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CARD DOMAIN CONTAINING POLYPEPTIDES , ENCODING 
NUCLEIC ACIDS, AND METHODS OF USE 

This invention was made in part with U.S. 
Government support under NIH Grant No. GM61694 awarded 
5 by the National Institutes of Health. The U.S. 
Government has certain rights in this invention. 

BACKGROUND OF THE INVENTION 



FIELD OF THE INVENTION 



This invention relates generally to the 
10 fields of molecular biology and molecular medicine and 
more specifically to the identification of proteins 
involved in programmed cell death, cytokine processing 
and receptor signal transduction, and associations of 
these proteins. 

15 BACKGROUND INFORMATION 

Programmed cell death is a physiologic 
process that ensures homeostasis is maintained between 
cell production and cell turnover in essentially all 
self -renewing tissues.. In many cases, characteristic 
20 morphological changes, termed "apoptosis, " occur in a 
dying cell. Since similar changes occur in different 
types of dying cells, cell death appears to proceed 
through a common pathway in different cell types. 

In addition to maintaining tissue 
25 homeostasis, apoptosis also occurs in response to a 
variety of external stimuli, including growth factor 
deprivation, alterations in calcium levels, free- 
radicals, cytotoxic lymphokines, infection by some 
viruses, radiation and most chemotherapeutic agents. 
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Thus, apoptosis is an inducible event that likely is 
subject to similar mechanisms of regulation as occur, 
for example, in a metabolic pathway. In this regard, 
dysregulation of apoptosis also can occur and is 
5 observed, for example, in some types of cancer cells, 
which survive for a longer time than corresponding 
normal cells, and in neurodegenerative diseases where 
neurons die prematurely. In viral infections, 
induction of apoptosis can figure prominently in the 
10 pathophysiology of the disease process, because immune- 
based for eradication of viral infections depend on 
elimination of virus -producing host cells by immune 
cell attack resulting in apoptosis. 

Some of the proteins involved in programmed 
15 cell death have been identified and associations among 
some of these proteins have been described. However, 
additional apoptosis regulating proteins remain to be 
found and the mechanisms by which these proteins 
mediate their activity remains to be elucidated. The 
20 identification of the proteins involved in cell death 
and an understanding of the associations between these 
proteins can provide a means for manipulating the 
process of apoptosis in a cell and, therefore, 
selectively regulating the relative lifespan of a cell 
25 or its relative resistance to cell death stimuli. 

The principal effectors of apoptosis are a 
family of intracellular proteases known as Caspases, 
representing an abbreviation for Cysteine Aspartyl 
Proteases. Caspases are found as inactive zymogens in 
30 essentially all animal cells. During apoptosis, the 
caspases are activated by proteolytic processing at 
specific aspartic acid residues, resulting in the 
production of subunits that assemble into an active 
protease typically consisting of a heterotetramer 
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containing two large and two small subunits. The 
phenomenon of apoptosis is produced directly or 
indirectly by the activation of caspases in cells, 
resulting in the proteolytic cleavage of specific 
5 substrate proteins. Moreover, in many cases, caspases 
can cleave and activate themselves and each other, 
creating cascades of protease activation and mechanisms 
for "auto" -activation. Thus, knowledge about the 
proteins that interact with and regulate caspases is 

10 important for devising strategies for manipulating cell 
life and death in therapeutically useful ways. In 
addition, because capsases can also participate in 
cytokine activation and other processes, knowledge 
about the proteins that interact with caspases can be 

15 important for manipulating immune responses and other 
biochemical processes in useful ways. 

One of the mechanisms for regulating caspase 
activation involves protein-protein interactions 
mediated by a family of protein domains known as 

20 caspase recruitment domains (CARDs) . The 

identification of proteins that contain CARD domains 
and the elucidation of the proteins with which they 
interact, therefore, can form the basis for strategies 
designed to alter apoptosis, cytokine production, 

25 cytokine receptor signaling, and other cellular 

processes. Thus, a need exists to identify proteins 
that contain CARD domains. The present invention 
satisfies this need and provides additional advantages 
as well. 

3 0 SUMMARY OF THE INVENTION 



The invention provides caspase recruitment 
domain (CARD) -containing polypeptides, and CARD, 
NB-ARC, ANGIO-R, LRR and SAM domains therefrom. Also 
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provided are chimeric polypeptides containing a CARD, 
NB-ARC, ANGIO-R, LRR or SAM domain of a CARD- containing 
polypeptide. Methods of producing CARD- containing 
polypeptides, and compositions containing 
5 CARD- containing polypeptides and a pharmaceutical^ 
acceptable carrier, are also provided. 

The invention further provides nucleic acid 
molecules encoding CARD- containing polypeptides and 
CARD, NB-ARC, ANGIO-R, LRR and SAM domains therefrom. 
10 Also provided are antibodies directed against such 
polypeptides . 

The invention also provides methods for 
identifying a nucleic acid molecule encoding a 
CARD- containing polypeptide, and methods for detecting 
15 the presence of a CARD-containing polypeptide in a 
sample . 



Further provided are methods of identifying a 
CARD-associated polypeptide (CAP) , and methods of 
identifying an effective agent that alters the 
association of a CARD-containing polypeptide with a 
CAP. The invention also provides methods of 
identifying an effective agent that modulates an 
activity of a NB-ARC domain of a CARD-containing 
polypeptide . 



20 



2 5 The invention also provides methods of 

altering the level of a biochemical process modulated 
by a CARD-containing polypeptide. 



The invention further provides methods of 
treating a pathology characterized by abnormal cell 
30 proliferation, abnormal cell death, or inflammation. 
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Also provided are methods of diagnosing or 
predicting clinical prognosis of a pathology 
characterized by an increased or decreased level of a 
CARD-containing polypeptide in a subject. 

5 

BRIEF DESCRIPTION OF THE FIGURES 

Figure 1A shows the genomic organization of 
the CLAN (CARD 4/5X) gene on chromosome 2 deduced from 
the BAC 164M19 sequence from the SPG4 candidate region 

10 at 2p21-2p22 (Accession No. AL121653) and Homo sapiens 
chromosome 2 working draft sequence (Accession No. 
NT_005194.1) . Figure IB shows mRNA splicing generating 
CLAN A, B, C and D. Figure 1C shows the deduced domain 
structure for the splice forms of CARD4/5X (CLAN A, B, 

15 C and D) . 

Figure 2 shows an alignment of the protein 
sequence of the isoforms of CLAN (designated CLAN A, B, 
C and D; SEQ ID NOS:97, 99, 103 and 101, respectively). 
Dark boxes indicate identical amino acids, and white 
2 0 , boxes indicate conserved amino acids. 

Figure 3 shows the amino acid sequences of 
the CARD -A, CARD-B and NB-ARC domains of CARD3X (SEQ ID 
NOS: 170, 172 and 174, respectively) . 

Figure 4 shows an alignment of COP-1 (SEQ ID 
25 NO:86) and caspase-1 (SEQ ID NO:87). The amino acids 
shaded in black are identical. 



Figure 5 shows an alignment of COP-2 (SEQ ID 
NO:90) and caspase-1 (SEQ ID NO:87), with the consensus 
sequence (SEQ ID NO: 91) shown above the aligned 
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sequences . 
identical . 
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The amino acids shaded in black are 



Figure 6 shows IL-lfJ secretion by COS7 ceils 
transfected with the indicated amounts of expression 
5 vectors encoding the indicated proteins. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides novel 
polypeptides involved in programmed cell death, or 
apoptosis. The principal effectors of apoptosis are a 

10 family of intracellular cysteine asp artyl prote ases , 
known as caspases. Caspase activity in the cell is 
regulated by protein-protein interactions. Similarly, 
protein-protein interactions influence the activity of 
other proteins involved in apoptosis. Several protein 

15 interaction domains have been implicated in 

interactions among some apoptosis-regulating proteins. 
Among these is the caspase recruitment domain, or CARD- 
containing polypeptide which are so named for the 
ability of the CARD- containing polypeptides to bind 

20 caspases. In addition to their ability to bind 

caspases, numerous CARD- containing polypeptides bind 
other proteins, particularly, other CARD-containing 
polypeptides. Further, CARD-containing polypeptides 
influence a variety of cellular and biochemical 

2 5 processes beyond apoptosis, including cell adhesion, 
inflammation and cytokine receptor signaling. 

In accordance with the present invention, 
there are provided isolated CARD-containing 
polypeptides or functional fragments thereof, 
30 comprising substantially the same amino acid sequence 
as set forth in any of SEQ ID NOS : 12, 168, 188, 170, 



WO 01/90156 PCT/US01/17158 

7 

172, 174, 176, 97, 99, 101, 103, 178, 180, 182, 184, 86 
and 90. 

The sequence identifiers set forth above 
correspond to the molecules described herein as set 
5 forth in Table 1. 



Table 1 



Designation 


Nucleotide 

oil*U -LJJ 1MU : 


Polypeptide 

biiU ID JMU : 


p7\ pnov 


1 1 




^-rllvU^ A. L^jfiuXL/ J-/C/11 Id _L 11 


-Lo / 


-Lob 




1 ft 1 
±0 / 


loo ana ± o y 


PZ\Pn^y PAPT^k rinma-in 


1 £ Q 
± 0 z> 


X /u 


CARD3X CARDB Domain 


111 


1 79 


CARD 3 X MB -ARC Domain 


173 


174 


CARD3X ANGIO-R Domain 


175 


176 


CLAN A 


96 


97 


CLAN B 


98 


99 


CLAN C 


100 


101 


CLAN D 


102 


103 


CLAN CARD 


177 


178 


CLAN NACHT 


179 


180 


CLAN LRR 


181 


182 


CLAN SAM 


183 


184 


COP1 


85 


86 


COP2 


89 


90 



25 The terms w CARD-containing protein" or "CARD- 

containing polypeptide 7 ' as used herein refer to a 
protein or polypeptide containing a CARD domain. As 
used herein, the term U CARD domain" refers to a Caspase 
Recruitment Domain. A CARD domain is a well known 
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protein domain of approximately 80 amino acids with 
characteristic sequence conservation as described, fox- 
example, in Hofmann et al . , Trends Biochem. Sci . 
22:155-156 (1997). CARD domains have been found in 
5 some members of the Caspase family of cell death 

proteases. Caspases-1, 2, 4, 5, 9, and 11 contain CARD 
domains near their NH2-termini. These CARD domains 
mediate interactions of the zymogen inactive forms of 
caspases with other proteins which can either activate 
10 or inhibit the activation of these enzymes. 



For example, the CARD domain of pro-caspase- 9 
binds to the CARD domain of a caspase-act ivating 
protein called Apaf-1 (Apoptosis Protease Activating 
Factor- 1) . Similarly, the CARD domain of pro-caspase-1 

15 permits interactions with another CARD protein known as 
Cardiac (also referred to as RIP2 and RICK) , which 
results in activation of the caspase-1 protease (Thome 
et al., Curr. Biol. 16:885-888 (1998)). Additionally, 
pro-caspase-2 binds to the CARD protein Raidd (also 

2 0 know as Cradd) , which permits recruitment of 

pro-caspase-2 to Tumor Necrosis Factor (TNF) Receptor 
complexes and which results in activation of the 
caspase-2 protease (Ahmad et al . , Cancer Res. 
57:615-619 (1997)). CARD domains can also participate 

25 in homotypic interactions with themselves, resulting in 
self -association of polypeptides that contain these 
protein- interaction domains and producing dimeric or 
possibly even oligomeric complexes. 



CARD domains can be found in association with 
3 0 other types of functional domains within a single 

polypeptide, thus providing a mechanism for bringing a 
functional domain into close proximity or contact with 
a target protein via CARD: CARD associations involving 
two CARD- containing polypeptides. For example, the 
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Caenorhabiditis elegans cell death gene ced-4 encodes a 
protein that contains a CARD domain and a ATP-binding 
oligomerization domain called an NB-ARC domain (van der 
Biezen and Jones, Curr. Biol. 8:R226-R227) . The CARD 
5 domain of the CED-4 protein interacts with the CARD 
domain of a pro-caspase called CED-3. The NB-ARC 
domain allows CED-4 to self -associate, thereby forming 
an oligomeric complex which brings associated pro-CED-3 
molecules into close proximity to each other. Because 

10 most pro- caspases possess at least a small amount of 
protease activity even in their unprocessed form, the 
assembly of a complex that brings the proforms of 
caspase into juxtaposition can result in 
trans-processing of zymogens, producing the 

15 proteolytically processed and active caspase. Thus, 
CED-4 employs a CARD domain for binding a pro -caspase 
and an NB-ARC domain for self -oligomerization, 
resulting in caspase clustering, proteolytic processing 
and activation. 



2 0 In addition to their role in caspase 

activation, CARD domains have been implicated in other 
cellular processes. Some CARD- containing polypeptides, 
for example, induce activation of the transcription 
factor NF-kB. NF-kB activation is induced by many 
25 cytokines and plays an important role in cytokine 

receptor signal transduction mechanisms (DiDonato et 
al., Nature 388:548-554 (1997)). Moreover, CARD 
domains are found in some proteins that inhibit rather 
than activate caspases, such as the IAP (Inhibitor of 

3 0 Apoptosis Protein) family members, cIAPl and cIAP2 

(Rothe et al . , Cell 83:1243-1252 (1995)) and oncogenic 
mutants of the Bcl-10 protein (Willis et al . , Cell 
96:35-45 (1999)). Also, though caspase activation 
resulting from CARD domain interactions is often 
35 involved in inducing apoptosis, other caspases are 
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primarily involved in proteolytic processing and 
activation of inflammatory cytokines (such as pro-IL-lb 
and pro-IL-18) . Thus, CARD-containing polypeptides can 
also be involved in cytokine receptor signaling and 
5 cytokine production, and, therefore, can be involved in 
regulation of immune and inflammatory responses. 

In view of the function of the CARD domain 

within the invention CARD-containing polypeptides or 

functional fragments thereof, polypeptides of the 
10 invention are contemplated herein for use in methods to 

alter biochemical processes such as apoptosis, NF-kB 

induction, cytokine processing, cytokine receptor 

signaling, caspase-mediated proteolysis, thus having 

modulating effects on cell life and death (i.e., 
15 apoptosis) , inflammation, cell adhesion, and other 

cellular and biochemical processes. 

Invention CARD-containing polypeptides or 
functional fragments thereof (including CARD domains) 
are also contemplated in methods to identify CARD- 

2 0 binding agents and CARD-associated polypeptides (CAPs) 

that alter apoptosis, NF-kB induction, cytokine 
processing, cytokine receptor signaling, caspase- 
mediated proteolysis, thus having modulating effects on 
cell life and death (i.e., apoptosis), inflammation, 
25 cell adhesion, and other cellular and biochemical 
processes . 

It is also contemplated herein that invention 
CARD-containing polypeptides can associate with other 
CARD-containing polypeptides to form invention hetero- 

3 0 oligomers or homo -oligomers , such as heterodimers or 

homodimers. In particular, the association of the CARD 
domain of invention polypeptides with other CARD- 
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containing polypeptides, such as Apaf-1, CED-4, 
caspases-1, 2, 9, 11, cIAPs-1 and 2, CARDIAK, Raidd, 
Dark, CLAN, other invention CARD -containing 
polypeptides, and the like, including homo- 
5 oligomerization, is sufficiently specific such that the 
bound complex can form in vivo in a cell or in vitro 
under suitable conditions. Similarly therefore, an 
invention CARD- containing polypeptide can associate 
with another CARD -containing polypeptide by CARD: CARD 
10 form invention hetero-oligomers or homo -oligomers , such 
as heterodimers or homodimers . 



In accordance with the present invention, 
sequences for novel CARD- containing polypeptides have 
been determined. Thus, the present invention provides 
15 novel CARD- containing polypeptides, including the newly 
identified CARD -containing polypeptides designated 
CARD2X, CARD3X, CLAN A, CLAN B, CLAN C, CLAN D, COP-1 
and COP-2 (set forth in SEQ ID NOS : 12, 188, 97, 99, 
101, 103, 86 and 90) . 

20 In addition to CARD domains, invention 

polypeptides can contain one or more additional 
domains . The locations within the reference sequence 
of the domains described herein are set forth in 
Table 2 . 
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Table 2 



Domain 


Corresoondincf amino 


SEQ ID 


acids 


NO: 


CARD 2 X 
CARD Domain 


4-78 of SEQ ID NO:12 


167 (nt) 

168 (aa) 


CARD 3 X 

CARDA Domain 


2-78 of SEQ ID NO:107 


169 (nt) 

170 (aa) 


CARD 3 X 

CARDB Domain 


105-185 of SEQ ID 
NO: 107 


171 (nt) 

172 (aa) 


CARD3X 

NB-ARC Domain 


265-560 of SEQ ID 
NO:107 


173 (nt) 

174 (aa) 


CARD 3 X 

ANGIO-R Domain 


437-839 of SEQ ID 
NO:107 


175 (nt) 

176 (aa) 


CLAN 

CARD Domain 


1-87 of SEQ ID NO:97 


177 (nt) 

178 (aa) 


CLAN 

NACHT Domain 


161-457 of SEQ ID 
NO: 97 


179 (nt) 

180 (aa) 


CLAN 

LRR Domain 


760-965 of SEQ ID 
NO: 97 


181 (nt) 

182 (aa) 


CLAN 

SAM Domain 


642-696 of SEQ ID 
NO: 97 


183 (nt) 

184 (aa) 



10 



15 



20 



CARD3X (SEQ ID NO: 88) contains at least four 
distinct domains: two CARD domains, designated CARD -A 
and CARD-B, an NB-ARC domain and an angio-R domain. A 
second in- frame, open reading frame that begins after a 

25 stop codon encodes a domain with several leucine rich 
repeats (LRR) (SEQ ID NO: 189) (see Example) . An 
invention CARD 3 X polypeptide can thus contain the amino 
acid sequence designated SEQ ID NO: 188 and the amino 
acid sequence designated SEQ ID NO: 189; contain SEQ ID 

30 NO: 188 but not SEQ ID NO: 189; or contain SEQ ID NO: 189 
but not SEQ ID NO: 188. A murine CARD3X polypeptide can 
contain the amino acid sequence designated SEQ ID 
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NO: 193, which is homologous to a portion of the human 
CARD3X ANGIO-R domain, with or without one or more 
additional CARD 3 X domains. 



CLAN exists in four isoforms (see Example) , 
5 each of which contains a CARD domain. The longest 
isoform, CLAN-A, also contains an NB-ARC (NACHT) 
domain, a LRR domain and a SAM domain. CLAN represents 
a new member of the CED-4 related protein family. 
Numerous CED-4 -related proteins have recently been 

10 identified. These proteins belong to the CED-4 family 
of proteins, and include CED-4 (Yuan and Horvitz, 
Development 116:309-320 (1992)), Apaf-1, (Zou et al . , 
Cell 90:405-413 (1997)), Dark (Rodriguez et al . , Nature 
Cell Biol . 1:272-279 (1999)), and CARD4/ Nodi (Bertinet 

15 al., J. Biol. Chem . 274:12955-12958 (1999) and Inohara 
et al., J. Biol. Chem . 274:14560-14567 (1999)). As 
used herein, a "CED-4 family" member or "CED-4 protein 
family" member, also referred to herein as a "NAC" 
polypeptide, is a polypeptide that comprises a NB-ARC 

2 0 domain and a CARD domain. 



The CED-4 homolog in humans and rodents, 
referred to as Apaf-1, contains a (i) CARD domain, (ii) 
NB-ARC domain, and (iii) multiple copies of a WD-repeat 
domain. In contrast to CED-4 which can spontaneously 

2 5 oligomerize, the mammalian Apaf-1 protein is an 

inactive monomer until induced to oligomerize by 
binding of a co- factor protein, cytochrome c (Li et 
al., Cell 91:479-489 (1997)). In Apaf-1, the WD repeat 
domains prevent oligomerization of the Apaf-1 protein, 

3 0 until coming into contact with cytochrome c. Thus, the 

WD-repeats function as a negative -regulatory domain - 
that maintains Apaf-1 in a latent state until 
cytochrome c release from damaged mitochondria triggers 
the assembly of an oligomeric Apaf-1 complex (Saleh, 
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Biol. Chem. 274:17941-17945 (1999)). By binding 
pro-caspase-9 through its CARD domain, Apaf-1 
oligomeric complexes are thought to bring the zymogen 
forms of caspase-9 into close proximity, permitting 
5 them to cleave each other and produce the proteolytic 
processed and active caspase-9 protease (Zou et al . , J. 
Biol. Chem. 274:11549-11556 (1999) ) . 



Another characteristic of the invention 
CARD- containing polypeptides is that they can associate 

10 with pro-caspases, caspases or with caspase-associated 
proteins, thereby altering caspase proteolytic 
activity. Caspase proteolytic activity is associated 
with apoptosis of cells, and additionally with cytokine 
production. Therefore, an invention CARD -containing 

15 polypeptide can alter apoptosis or cytokine production 
by altering caspase proteolytic activity. As used 
herein a n caspase" is any member of the cysteine 
aspartyl proteases. Typically, as caspase can 
associate with a CARD -containing polypeptide of the 

20 invention such as a NAC polypeptide. Similarly, a 

"pro-caspase" is an inactive or less-active precursor 
form of a caspase, which is typically converted to the 
more active caspase form by a proteolytic event, and 
often a proteolytic event preceded by a protein:protein 

25 interaction such as a CARD: CARD interaction, and the 
like . 



As described in the Example, COP-1 interacts 
with the prodomain of pro-caspase-1 and also with RIP2, 
a protein previously demonstrated to bind the prodomain 
30 of pro-caspase-1. COP-1 competes with RIP2 for binding 
to pro-caspase-1, thereby inhibiting RIP2-mediated 
caspase-1 oligomerization. Consequently, COP-1 
interferes with the ability of RIP2 to enhance 
caspase-l-induced secretion of mature IL- 1(3. 
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Therefore, COP-1 is likely to play a role in 
controlling IL-1(3 generation and thereby opposing 
IL-lfS-induced inflammation. IL-1|B plays a critical 
role in septic shock, which currently represents the 
5 most common cause of lethality in patients treated in 
the intensive care setting. Accordingly, COP-1 likely 
plays a role in IL-lp homeostasis to prevent systemic 
inflammatory reactions when challenged with 
gram-negative bacteria or other inflammatory insults. 



10 As also described in the Example, because of 

their interactions with diverse other CARD proteins, 
the isoforms of CLAN (A, B, C and D) likely influence 
apoptosis, cytokine processing, or NF-kB activity. 
Interactions of CLAN with pro-caspase-1 likely 

15 indicates a role for CLAN as a IL-ip regulator. In 
this regard, different isoforms of CLAN likely have 
opposing effects on pro-caspase-1 activation. The 
longest isoform, CLAN-A, for example, can trigger pro- 
caspase-1 activation by the "induced proximity 7 ' 

2 0 mechanism as a result of oligomerization mediated by 

its NB-ARC (NACHT) domain. In contrast, the shorter 
isoforms of CLAN lacking this self -oligomerization can 
operate as competitive antagonists of pro-caspase-1 
activation, analogous to ICEBERG, a CARD -containing 
25 protein that competes with CARDIAK (RIP2/RICK) for 
binding to pro-caspase-1. Interactions of CLAN with 
NAC also suggest this protein can have an influence on 
apoptosis mediated by Apaf-1, in as much as NAC binds 
Apaf-1 and enhances its ability to activate caspase-9 

3 0 in response to cytochrome c. Finally, CLAN 

associations with NF-kB regulators such as Bel -10 and 
Nod2 strongly suggest that at least some of the CLAN 
isoforms can influence the activity of this 
transcription factor. 
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In addition to the ability to bind caspases, 
invention CARD-containing polypeptides can contain a 
protease domain/ such as a protease domain found in 
caspase, and the like. A caspase protease domain 
5 hydrolyzes amide bonds, particularly the amide bond of 
a peptide or polypeptide backbone. Typically, a 
caspase protease domain contains a P2 0/P10 domain in 
the active site region of the caspase protease domain. 
Thus, a caspase protease domain has proteolytic 
10 activity. 

CARD-containing polypeptides are also known 
to induce activation of the transcription factor NF-kB. 
Thus, an invention CARD-containing polypeptide can also 
alter transcription by, for example, modulation of 
15 NF-kB activity, and the like. 

The NB-ARC (NACHT) domain of invention NAC 
polypeptides such as CLAN and CARD 3 X (see Example) 
associates with other polypeptides, particularly with 
polypeptides comprising NB-ARC domains. Thus, a 

2 0 functional NB-ARC domain associates with NB-ARC 

domain-containing polypeptides by way of NB-ARC :NB-ARC 
association. As used herein, the term "associate" or 
"association" means that CARD-containing polypeptide 
such as a NAC polypeptide can bind to a polypeptide 
25 relatively specifically and, therefore, can form a 

bound complex. For example, the association of a CARD 
domain of an invention CARD-containing polypeptide with 
another CARD-containing polypeptide or the association 
of a NB-ARC domain of NAC with another NB-ARC 

3 0 domain- containing polypeptides is sufficiently specific 

such that the bound complex can form in vivo in a cell 
or in vitro under suitable conditions. 
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Further, a NB-ARC domain demonstrates both 
nucleotide -binding (e.g., ATP-binding) and hydrolysis 
activities, which is typically required for its ability 
to associate with NB-ARC domain-containing 
5 polypeptides. Thus, an NB-ARC domain of the invention 
NAC comprises one or more nucleotide binding sites. As 
used herein, a nucleotide binding site is a portion of 
a polypeptide that specifically binds a nucleotide such 
as, e.g., ADP, ATP, and the like. Typically, the 

10 nucleotide binding site of NB-ARC will comprise a P- 
loop, a kinase 2 motif, or a kinase 3a motif of the 
invention NAC (these motifs are defined, for example, 
in van der Biezen and Jones, supra ) . Preferably, the 
nucleotide binding site of NB-ARC comprises a P-loop of 

15 the invention NAC. The NB-ARC domain of the an 

invention CARD-containing polypeptide, therefore, is 
capable of associating with other NB-ARC domains in 
homo- or hetero-oligormerization. Additionally, the 
NB-ARC domain is characterized by nucleotide hydrolysis 

20 activity, which can influence the ability of an NB-ARC 
domain to associate with another NB-ARC domain. 



An invention NAC, therefore, is capable of 
CARD: CARD association and/or NB-ARC :NB-ARC association, 
resulting in a multifunctional polypeptide capable of 

25 one or more specific associations with other 
polypeptides. An invention NAC can alter cell 
processes such as apoptosis, cytokine production, and 
the like. For example, it is contemplated herein that 
an invention NAC polypeptide can increase the level of 

3 0 apoptosis in a cell. It is also contemplated herein 
that an invention NAC can decrease the level of 
apoptosis in a cell. For example, a NAC which does not 
induce apoptosis may form hetero-oligomers with a NAC 
which is apoptotic, thus interfering with the 

35 apoptosis -inducing activity of NAC. 
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In another embodiment of the invention, a 
CARD-containing polypeptide of the invention, such as 
CLAN (SEQ ID NOS:96, 98, 100 and 102) and an isoform of 
CARDS X (containing SEQ ID NO:i89) also contains 
5 Leucine-Rich Repeats (LRR) domain. LRR domains are 
well known in the art and, in one embodiment, the LRR 
domain of an invention CARD-containing polypeptide has 
substantially the same sequence as a LRR described in 
another CARD-containing polypeptide known as Nodi 
10 (Inohara et al . , J. Biol. Chem. 274:14560-14567 

(1999)). The function of the LRR domain is to mediate 
specific interactions with other polypeptides. 

In another embodiment of the invention, there 
are provided CARD-containing polypeptides that contain 
an NB-ARC domain and a CARD domain. NAC polypeptide 
sequences disclosed herein, for example, CARD4/5X 
(CLAN) , modulate a variety of biochemical processes 
such as apoptosis. NAC polypeptides can also have 
other domains that modulate biochemical processes such 
as an LRR domain or a WD domain. 



15 



20 



Those of skill in the art will recognize that 
numerous residues of the above -described sequences can 
be substituted with other, chemically, sterically 
and/or electronically similar residues without 

25 substantially altering the biological activity of the 
resulting CARD-containing polypeptide species. In 
addition, larger polypeptide sequences comprising 
substantially the same sequence as amino acids set 
forth in SEQ ID NOS.-12, 168, 188, 170, 172, 174, 176, 

30 97, 99, 101, 103, 178, 180, 182, 184, 86 and 90, 
therein are contemplated within the scope of the 
invention. 



\ 
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As employed herein, the term "substantially 
the same amino acid sequence" refers to amino acid 
sequences having at least about 70% or 75% identity 
with respect to the reference amino acid sequence, and 
5 retaining comparable functional and biological activity 
characteristic of the polypeptide defined by the 
reference amino acid sequence. Preferably, 
polypeptides having "substantially the same amino acid 
sequence" will have at least about 80%, 82%, 84%, 86% 

10 or 88%, more preferably 90%, 91%, 92%, 93% or 94% amino 
acid identity with respect to the reference amino acid 
sequence; with greater than about 95%, 96%, 97%, 98% or 
99% amino acid sequence identity being especially 
preferred. It is recognized, however, that 

15 polypeptides or nucleic acids containing less than the 
described levels of sequence identity arising as splice 
variants or that are modified by conservative amino 
acid substitutions, or by substitution of degenerate 
codons are also encompassed within the scope of the 

2 0 present invention „ 



In accordance with the invention, 
specifically included within the definition of 
substantially the same amino acid sequence is the 
predominant amino acid sequence of a particular 

25 invention CARD- containing polypeptide or domain 

disclosed herein. The predominant amino acid sequence 
refers to the most commonly expressed naturally 
occurring amino acid sequence in a species population. 
A predominant polypeptide with multiple isoforms will 

30 have the most commonly expressed amino acid sequence 
for each isoform. A predominant CARD- containing 
polypeptide of the invention refers to an amino acid 
sequence having sequence identity to an amino acid 
sequence disclosed herein that is greater than that of 
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any other naturally occurring protein of a particular 
species (e.g., human). 



Given the teachings herein of the location 
and nucleic acid or amino acid sequences corresponding 
5 to the invention CARD-containing polypeptides, one of 
skill in the art can readily confirm and, if necessary, 
revise the nucleic acid or amino acid sequences 
associated with the CARD-containing polypeptides of the 
invention. For example, the sequences can be confirmed 

10 by probing a cDNA library with a nucleic acid probe 

corresponding to a nucleic acid of the invention using 
PCR or other known methods. Further, an appropriate 
bacterial artificial chromosome containing the region 
of the genome encoding an invention CARD-containing 

15 polypeptide can be commercially obtained and probed 
using PCR, restriction mapping, sequencing, and other 
known methods . 



The term "biologically active" or 
"functional", when used herein as a modifier of 

20 invention CARD-containing polypeptides, or polypeptide 
fragments thereof, refers to a polypeptide that 
exhibits functional characteristics similar to a CARD- 
containing polypeptide of the invention. Biological 
activities of a CARD-containing polypeptide include 

25 for example, the ability to bind, preferably in vivo, 

to a nucleotide, to a CARD-associated polypeptide, to a 
NB -ARC -containing polypeptide, or to homo- oligomer ize, 
or to alter protease activation, particularly caspase 
activation, or to catalyze reactions such as 

30 proteolysis or nucleotide hydrolysis, or to alter NF-kB 
activity, or to alter apoptosis, cytokine processing, 
cytokine receptor signaling, inflammation, immune 
response, and other biological activities described 
herein. 
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The ability of a CARD- containing polypeptide 
to bind another polypeptide such as a CARD-associated 
polypeptide can be assayed, for example, using the 
methods well known in the art such as yeast two-hybrid 
5 assays, co-immunoprecipitation, GST fusion co- 
purification, and other methods provided in standard 
technique manuals such as Sambrook, supra , and Ausubel 
et al., supra . Another biological activity of a CARD- 
containing polypeptide is the ability to act as an 

10 immunogen for the production of polyclonal and 

monoclonal antibodies that bind specifically to an 
invention CARD- containing polypeptide. Thus, an 
invention nucleic acid encoding a CARD- containing 
polypeptide can encode a polypeptide specifically 

15 recognized by an antibody that also specifically 

recognizes a CARD- containing polypeptide (preferably 
human) including the amino acid set forth in SEQ ID 
NOS: 12, 168, 188, 170, 172, 174, 176, 97, 99, 101, 
103, 178, 180, 182, 184, 86 and 90. Such immunologic 

2 0 activity may be assayed by any method known to those of 
skill in the art. For example, a test -polypeptide can 
be used to produce antibodies, which are then assayed 
for their ability to bind to an invention polypeptide. 
If the antibody binds to the test -polypeptide and to 

2 5 the reference polypeptide with substantially the same 

affinity, then the polypeptide possesses the requisite 
immunologic biological activity. 

As used herein, the term "substantially 
purified" means a polypeptide that is in a form that is 

3 0 relatively free from contaminating lipids, 

polypeptides, nucleic acids or other cellular material 
normally associated with a polypeptide in a cell. A 
substantially purified CARD- containing polypeptide can 
be obtained by a variety of methods well-known in the 
35 art, e.g., recombinant expression systems described 
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herein, chemical synthesis or purification from native 
sources. Purification methods can include, for 
example, precipitation, gel filtration, ion-exchange, 
reverse-phase and affinity chromatography, and the 
5 like. Other well-known methods are described in 
Deutscher et al . , "Guide to Protein Purification" 
Methods in Enzymology Vol. 182, (Academic Press, 
(1990) ) . Alternatively, the isolated polypeptides of 
the present invention can be obtained using well-known 

10 recombinant methods as described, for example, in 
Sambrook et al . , supra , (1989) and Ausubel et al . , 
supra (2000) . The methods and conditions for 
biochemical purification of a polypeptide of the 
invention can be chosen by those skilled in the art, 

15 and purification monitored, for example, by an 

immunological assay, binding assay, or a functional 
assay. 



In addition to the ability of invention 
CARD- containing polypeptides, or functional fragments 

20 thereof, to interact with other, heterologous proteins 
(e.g., CARD- containing polypeptides), invention 
CARD-containing polypeptides have the ability to 
self -associate to form invention homo -oligomers such as 
homodimers . This self -association is possible through 

2 5 interactions between CARD domains, and also through 
interactions between NB-ARC domains. Further, 
self -association can take place as a result of 
interactions between LRR domains. 



In accordance with the invention, there are 
3 0 also provided mutations and fragments of CARD- 

containing polypeptides which have activity different 
than a predominant naturally occurring CARD- containing 
polypeptide activity. As used herein, a "mutation" can 
be any deletion, insertion, or change of one or more 



WO 01/90156 PCT/US01/17158 

23 

amino acids in the predominant naturally occurring 
protein sequence (e.g., wild-type), and a "fragment" is 
any truncated form, either carboxy- terminal , 
amino- terminal, or both, of the predominant naturally 
5 occurring protein. Preferably, the different activity 
of the mutation or fragment is a result of the mutant 
polypeptide or fragment maintaining some but not all of 
the activities of the respective predominant naturally 
occurring CARD- containing polypeptide. 

10 For example, a functional fragment of an 

invention polypeptide can contain or consist of one or 
more of the following: a CARD domain, a NB-ARC domain, 
a LRR domain, a SAM domain, or an angio-R domain. In a 
specific example, a fragment of a CARD- containing 

15 polypeptide such as CLAN can contain a CARD domain and 
LRR domain, but lack a functional NB-ARC domain. Such 
a fragment will maintain a portion of the predominant 
naturally occurring CLAN activity (e.g., CARD domain 
functionality), but not all such activities (e.g., 

2 0 lacking an active NB-ARC domain) . The resultant 

fragment will therefore have an activity different than 
the predominant naturally occurring CLAN activity. In 
another example, the CLAN polypeptide might have only 
the NB-ARC domain, allowing it to interact with other 

25 NB-ARC domain proteins in forming homo -oligomers or 
hetero-oligomers . In one embodiment, the activity of 
the fragment will be * dominant - negat ive . " A dominant- 
negative activity will allow the fragment to reduce or 
inactivate the activity of one or more isoforms of a 

30 predominant naturally occurring CARD- containing 

polypeptide. Another functional fragment can include 
an angio-R domain (see Example) , or any of the domains 
disclosed herein (see, for example, Table 2) . 
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Isoforms of the CARD- containing polypeptides 
are also provided which arise from alternative mRNA 
splicing and may alter or modify the interactions of 
the CARD- containing polypeptide with other 
5 polypeptides. For example, four isoforms of CLAN and 
three isoforms of CARD 3 X are disclosed herein. 
Additional isoforms of the CARD- containing polypeptides 
designated SEQ ID NOS : 12, 188, 97, 99, 101, 103, 86 
and 90, are contemplated herein and therefore, are 
10 encompassed within the scope of the invention 
CARD-containing polypeptides. 

Methods to identify polypeptides containing a 
functional fragment of a CARD-containing polypeptide of 
the invention are well known in the art and are 

15 disclosed herein. For example, genomic or cDMA 

libraries, including universal cDNA libraries can be 
probed according to methods disclosed herein or other 
methods known in the art. Full-length polypeptide 
encoding nucleic acids such as full-length cDNAs can be 

20 obtained by a variety of methods well-known in the art. 
For example, 5' and 3' RACE, methodology is well known 
in the art and described in Ausubel et al . , supra , and 
the like. 

In another embodiment of the invention, 
25 chimeric polypeptides are provided comprising a CARD- 
containing polypeptide, or a functional fragment 
thereof, fused with another protein or functional 
fragment thereof. Functional fragments of a CARD- 
containing polypeptide include, for example, NB-ARC 
3 0 (NACHT) , CARD, LRR, and ANGIO-R domains or other 
fragments that retain a biological activity of an 
invention CARD-containing polypeptide. Polypeptides 
with which the CARD-containing polypeptide or 
functional fragment thereof are fused will include, for 
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example, glutathione-S- transferase, an antibody, or 
other proteins or functional fragments thereof which 
facilitate recovery of the chimera. Further, 
polypeptides with which a CARD- containing polypeptide 
5 or functional fragment thereof are fused will include, 
for example, lucif erase, green fluorescent protein, an 
antibody, or other proteins or functional fragments 
thereof which facilitate identification of the chimera. 
Still further polypeptides with which a CARD- containing 
10 polypeptide or functional fragment thereof are fused 

will include, for example, the LexA DNA binding domain, 
ricin, a-sarcin, an antibody or fragment thereof, or 
other polypeptides which have therapeutic properties or 
other biological activity. 

15 Further invention chimeric polypeptides 

contemplated herein are chimeric polypeptides wherein a 
functional fragment of a CARD -containing polypeptide is 
fused with a catalytic domain or a protein interaction 
domain from a heterologous polypeptide. For example, 

2 0 the NB-ARC domain of CLAN, as disclosed herein, can be 
replaced by the NB-ARC domain of other CARD 
polypeptides, such as CARD3X, and the like. Another 
example of such a chimera is a polypeptide wherein the 
CARD domain of CLAN is replaced by the CARD domain from 

25 CARD2X or CARD3X, and the like. In a further example, 
an NB-ARC domain can be fused with a caspase catalytic 
P2 0 domain to form a novel chimera with caspase 
activity. One of skill in the art will appreciate that 
a large number of chimeric polypeptides are readily 

30 available by combining domains of two or more CARD- 
containing polypeptides of the invention. Further, 
chimeric polypeptides can contain a functional fragment 
of a CARD- containing polypeptide of the invention fused 
with a domain of a protein known in the art, such as 

35 CED-4, Apaf-1, caspase- 1, and the like. 
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In another embodiment of the invention, 
polypeptides are provided comprising 10 or more 
contiguous amino acids selected from the gx~oup 
consisting of SEQ ID NOS:12, 188, 97, 99, 101, 103, 86 
5 and 90. 

As used herein, the term "polypeptide" when 
used in reference to a CARD-containing polypeptide or 
fragment is intended to refer to a peptide or 
polypeptide of two or more amino acids. The term 

10 "polypeptide analog" includes any polypeptide having an 
amino acid residue sequence substantially the same as a 
sequence specifically described herein in which one or 
more residues have been conservatively substituted with 
a functionally similar residue and which displays the 

15 ability to functionally mimic a CARD-containing 

polypeptide as described herein. A n modif ication" of 
an invention polypeptide also encompasses conservative 
substitutions of an invention polypeptide amino acid 
sequence. Conservative substitutions of encoded amino 

20 acids include, for example, amino acids that belong 

within the following groups: (1) non-polar amino acids 
(Gly, Ala, Val, Leu, and He); (2) polar neutral amino 
acids (Cys, Met, Ser, Thr, Asn, and Gin) ; (3) polar 
acidic amino acids (Asp and Glu) ; (4) polar basic amino 

25 acids (Lys, Arg and His) ; and (5) aromatic amino acids 
(Phe, Trp, Tyr, and His) . Other groupings of amino 
acids can be found, for example in Taylor, J. Theor. 
Biol. 119:205-218 (1986), which is incorporated herein 
by reference. Other minor modifications are included 

3 0 within invention polypeptides so long as the 

polypeptide retains some or all of its function as 
described herein. 

The amino acid length of functional fragments 
or polypeptide analogs of the present invention can 
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range from about 5 amino acids up to the full-length 
protein sequence of an invention CARD- containing 
polypeptide. In certain embodiments, the amino acid 
lengths include, for example, at least about 10 amino 
5 acids, at least about 15, at least about 20, at least 
about 25, at least about 30, at least about 35, at 
least about 40, at least about 45, at least about 50, 
at least about 55, at least about 60, at least about 
65, at least about 70, at least about 75, at least 

10 about 80, at least about 85, at least about 90, at 

least about 95, at least about 100, at least about 12 5, 
at least about 150, at least about 175, at least about 
2 00, at least about 2 50 or more amino acids in length 
up to the full-length CARD- containing polypeptide 

15 sequence. The functional fragments can be contiguous 
amino acid sequences of an invention polypeptide, 
including contiguous amino acid sequences of SEQ ID 
NOS: 12, 188, 97, 99, 101, 103, 86 and 90. A peptide 
of at least about 10 amino acids can be used, for 

2 0 example, as an immungen to raise antibodies specific 

for an invention CARD- containing polypeptide. 

A modification of a polypeptide can also 
include derivatives, analogues and functional mimetics 
thereof, provided that such polypeptide displays a 
25 CARD- containing polypeptide biological activity. For 
example, derivatives can include chemical modifications 
of the polypeptide such as alkylation, acylation, 
carbamylation, iodination, or any modification that 
derivatizes the polypeptide. Such derivatized 

3 0 molecules include, for example, those molecules in 

which free amino groups have been derivatized to form 
amine hydrochlorides, p-toluene sulfonyl groups, 
carbobenzoxy groups, t-butyloxycarbonyl groups, 
chloroacetyl groups or formyl groups. Free carboxyl 
35 groups can be derivatized to form salts, methyl and 
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ethyl esters or other types of esters or hydrazides. 
Free hydroxyl groups can be derivatized to form O-acyi 
or 0-alkyl derivatives. The imidazole nitrogen of 
histidine can be derivatized to form 
5 N-im-benzylhistidine. Also included as derivatives or 
analogues are those peptides which contain one or more 
naturally occurring amino acid derivatives of the 
twenty standard amino acids, for example, 
4-hydroxyproline, 5 -hydroxylysine , 3 -methylhistidine , 
homoserine, ornithine or carboxyglutamate , and can 
include amino acids that are not linked by peptide 
bonds. Polypeptides of the present invention also 
include any polypeptide having one or more additions 
and/or deletions of residues, relative to the sequence 
15 of a polypeptide whose sequence is shown herein, so 
long as CARD- containing polypeptide activity is 
maintained. 



10 



A modification of an invention polypeptide 
includes functional mimetics thereof. Mimetics 

2 0 encompass chemicals containing chemical moieties that 
mimic the function of the polypeptide. For example, if 
a polypeptide contains two charged chemical moieties 
having functional activity, a mimetic places two 
charged chemical moieties in a spatial orientation and 

2 5 constrained structure so that the charged chemical 
function is maintained in three-dimensional space. 
Thus, a mimetic, which orients functional groups that 
provide a function of a CARD -containing polypeptide, 
are included within the meaning of a CARD- containing 

30 polypeptide derivative. All of these modifications are 
included within the term "polypeptide" so long as the 
invention polypeptide or functional fragment retains 
its function. Exemplary mimetics are peptidomimetics , 
peptoids, or other peptide-like polymers such as 

35 poly(b-amino acids), and also non-polymeric compounds 
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upon which functional groups that mimic a peptide are 
positioned. 



Another embodiment of the invention provides 
a CARD- containing polypeptide, or a functional fragment 
5 thereof, fused with a moiety to form a conjugate. As 
used herein, a "moiety" can be a physical, chemical or 
biological entity which contributes functionality to a 
CARD -containing polypeptide or a functional fragment 
thereof. Functionalities contributed by a moiety 

10 include therapeutic or other biological activity, or 

the ability to facilitate identification or recovery of 
a CARD -containing polypeptide. Therefore, a moiety 
will include molecules known in the art to be useful 
for detection of the conjugate by, for example, by 

15 fluorescence, magnetic imaging, detection of 

radioactive emission. A moiety may also be useful for 
recovery of the conjugate, for example a His tag or 
other known tags used for protein isolation and/or 
purification, or a physical substance such as a bead. 

2 0 A moiety can be a therapeutic compound, for example, a 
cytotoxic drug which can be useful to effect a . 
biological change in cells to which the conjugate 
localizes . 



An example of the means for preparing the 
25 invention polypeptide (s) is to express nucleic acids 
encoding a CARD- containing polypeptide in a suitable 
host cell, such as a bacterial cell, a yeast cell, an 
amphibian cell such as an oocyte, or a mammalian cell, 
using methods well known in the art, and recovering the 
30 expressed polypeptide, again using well-known 

purification methods. Invention polypeptides can be 
isolated directly from cells that have been transformed 
with expression vectors as known in the art. 
Recombinantly expressed polypeptides of the invention 
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can also be expressed as fusion proteins with 
appropriate affinity tags, such as glutathione S 
transferase (GST) or poly His, and affinity purified. 
The invention polypeptide, biologically functional 
5 fragments, and functional equivalents thereof can also 
be produced by in vitro transcription/translation 
methods known in the art, such as using reticulocyte 
lysates, as used for example, in the TNT system 
(Promega) . The invention polypeptide, biologically 

10 functional fragments, and functional equivalents 

thereof can also be produced by chemical synthesis. 
For example, synthetic polypeptides can be produced 
using Applied Biosystems, Inc. Model 430A or 431A 
automatic peptide synthesizer (Foster City, CA) 

15 employing the chemistry provided by the manufacturer. 



In accordance with another embodiment of the 
invention, there are provided isolated nucleic acids 
encoding a CARD- containing polypeptide or functional 
fragment thereof. The isolated nucleic acids can be 
20 selected from: 

(a) DNA encoding a polypeptide 
containing the amino acid sequence set forth 
in SEQ ID NOs : 12, 168, 188, 170, 172, 174, 
176, 97, 99, 101, 103, 178, 180, 182, 184, 86 

2 5 and 90, or 

(b) DNA that hybridizes to the DNA of 
(a) under moderately stringent conditions, 
where the DNA encodes biologically active 
CARD- containing polypeptide, or 

30 (c) DNA degenerate with respect to (b) , 

where the DNA encodes biologically active 
CARD- containing polypeptide. 



The nucleic acid molecules described herein 
are useful for producing invention polypeptides, when 
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such nucleic acids are incorporated into a variety of 
protein expression systems known to those of skill in 
the art. In addition, such nucleic acid molecules or 
fragments thereof can be labeled with a readily 
5 detectable substituent and used as hybridization probes 
for assaying for the presence and/or amount of an 
invention CARD-encoding gene or mRNA transcript in a 
given sample. The nucleic acid molecules described 
herein, and fragments thereof, are also useful as 
10 primers and/or templates in a PCR reaction for 
amplifying genes encoding invention polypeptides 
described herein. 



The term "nucleic acid" (also referred to as 
polynucleotides) encompasses ribonucleic acid (RNA) or 

15 deoxyribonucleic acid (DNA) , probes, oligonucleotides, 
and primers and can be single stranded or double 
stranded. DNA can be either complementary DNA (cDNA) 
or genomic DNA, e.g. a CARD-encoding gene, and can 
represent the sense strand, the anti-sense strand, or 

2 0 both. Examples of nucleic acids are RNA, cDNA, or 
isolated genomic DNA encoding a CARD-containing 
polypeptide. One means of isolating a CARD-encoding 
nucleic acid is to probe a mammalian genomic or cDNA 
library with a natural or artificially designed DNA 

25 probe using methods well known in the art. DNA probes 
derived from the CARD-encoding gene are particularly 
useful for this purpose. DNA and cDNA molecules that 
encode CARD- containing polypeptides can be used to 
obtain complementary genomic DNA, cDNA or RNA from 

30 mammalian (e.g., human, mouse, rat, rabbit, pig, and 
the like) , or other animal sources, or to isolate 
related cDNA or genomic clones by screening cDNA or 
genomic libraries, using methods described in more 
detail below. Such nucleic acids include, but are not 

35 limited to, nucleic acids comprising substantially the 
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same nucleotide sequence as set forth in SEQ ID NOS : 
11, 167, 187, 169, 171, 173, 175, 96, 98, 100, 102, 
177, 179, 181, 183, 85 and 89. In general, a genomic 
sequence of the invention includes regulatory regions 
5 such as promoters, enhancers, and introns that are 
outside of the exons encoding a CARD- containing 
polypeptide but does not include proximal genes that do 
not encode a CARD -containing polypeptide. 

Thus a CARD-encoding nucleic acid as used 
10 herein refers to a nucleic acid encoding a CARD- 
containing polypeptide of the invention, or a 
functional fragment thereof. 



Use of the terms "isolated" and/or "purified" 
and/or "substantially purified" in the present 

15 specification and claims as a modifier of DNA, RNA, 
polypeptides or proteins means that the DNA, RNA, 
polypeptides or proteins so designated have been 
produced in such form by the hand of man, and thus are 
separated from their native in vivo cellular 

20 environment, and are substantially free of any other 
species of nucleic acid or protein. As a result of 
this human intervention, the recombinant DNAs, RNAs , 
polypeptides and proteins of the invention are useful 
in ways described herein that the DNAs, RNAs, 

25 polypeptides or proteins as they naturally occur are 
not . 



Invention nucleic acids encoding CARD- 
containing polypeptides and invention CARD- containing 
polypeptides can be obtained from any species of 
3 0 organism, such as prokaryotes, eukaryotes, plants, 
fungi, vertebrates, invertebrates, and the like. A 
particular species can be mammalian, As used herein, 
"mammalian" refers to a subset of species from which an 
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invention CARD-encoding nucleic acid is derived, e.g., 
human, rat, mouse, rabbit, monkey, baboon, bovine, 
porcine, ovine, canine, feline, and the like. A 
preferred CARD-encoding nucleic acid herein, is human 
5 CARD-encoding nucleic acid. 



In one embodiment of the present invention, 
cDNAs encoding the invention CARD- containing 
polypeptides disclosed herein comprise substantially 
the same nucleotide sequence as the coding region set 
10 forth in any of SEQ ID NOS : 11, 167, 187, 169, 171, 

173, 175, 96, 98, 100, 102, 177, 179, 181, 183, 85 and 
89. 



As employed herein, the term "substantially 
the same nucleotide sequence" refers to a nucleic acid 

15 molecule (DNA or RNA) having sufficient identity to the 
reference polynucleotide, such that it will hybridize 
to the reference nucleotide under moderately or highly 
stringent hybridization conditions. In one embodiment, 
a nucleic acid molecule having substantially the same 

2 0 nucleotide sequence as the reference nucleotide 

sequence encodes substantially the same amino acid 
sequence as that set forth in any of SEQ ID NOS: 12, 
168, 188, 170, 172, 174, 176, 97, 99, 101, 103, 178, 
180, 182, 184, 86 and 90. In another embodiment, a 

25 nucleic acid molecule having "substantially the same 
nucleotide sequence" as the reference nucleotide 
sequence has at least 60%, or at least 65% identity 
with respect to the reference nucleotide sequence, such 
as at least 70%, 72%, 74%, 76%, 78%, 80%, 82%, 84%, 

30 86%, 88%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% 
or 99% identity to the reference nucleotide sequence. 
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In accordance with the invention; 
specifically included within the definition of 
substantially the same nucleotide sequence is the 
predominant nucleotide sequence of a particular 
5 invention CARD- containing polypeptide described herein. 
The predominant nucleotide sequence refers to the most 
commonly present naturally occurring nucleotide 
sequence in a species population. A predominant CARD- 
encoding nucleic acid of the invention refers to a 
10 nucleotide sequence having sequence identity to a 

nucleotide sequence disclosed herein that is greater 
than that of any other naturally occurring nucleotide 
sequence of a particular species (e.g., human). 

In one embodiment, a nucleic acid molecule 
that has substantially the same nucleotide sequence as 
a reference sequence is a modification of the reference 
sequence. As used herein, a "modification" of a 
nucleic acid can include one or several nucleotide 
additions, deletions, or substitutions with respect to 
a reference sequence. A modification of a nucleic acid 
can include substitutions that do not change the 
encoded amino acid sequence due to the degeneracy of 
the genetic code. Such modifications can correspond to 
variations that are made deliberately, or which occur 
as mutations during nucleic acid replication. 

Exemplary modifications of the recited 
nucleotide sequences include sequences that correspond 
to homologs of other species, including mammalian 
species such as mouse, primates, including monkey and 
3 0 baboon, rat, rabbit, bovine, porcine, ovine, canine, 
feline, or other animal species. The corresponding 
nucleotide sequences of non-human species can be 
determined by methods known in the art, such as by PCR 
or by screening genomic, cDNA or expression libraries. 



20 
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Another exemplary modification of the 
invention CARD -encoding nucleic acid or CARD- containing 
polypeptide can correspond to splice variant forms of 
the CARD-encoding nucleotide sequence. Additionally, a 
5 modification of a nucleotide sequence can include one 
or more non-native nucleotides, having, for example, 
modifications to the base, the sugar, or the phosphate 
portion, or having a modified phosphodiester linkage. 
Such modifications can be advantageous in increasing 
10 the stability of the nucleic acid molecule. 

Furthermore, a modification of a nucleotide 
sequence can include, for example, a detectable moiety, 
such as a radiolabel, a f luorochrome , a ferromagnetic 
substance, a luminescent tag or a detectable binding 
15 agent such as biotin. Such modifications can be 

advantageous in applications where detection of a CARD- 
encoding nucleic acid molecule is desired. 



In another embodiment, a nucleic acid 
molecule that has substantially the same nucleotide 
2 0 sequence as a reference sequence is a functionally 
equivalent nucleic acid, which indicates that it is 
phenotypically similar to the reference nucleic acid. 
As used herein, the phrase "functionally equivalent 
nucleic acids" encompasses nucleic acids characterized 

2 5 by slight and non-consequential sequence variations 

that will function in substantially the same manner to 
produce the same polypeptide product (s) as the nucleic 
acids disclosed herein. In particular, functionally 
equivalent nucleic acids encode polypeptides that are 

3 0 the same as those encoded by the nucleic acids 

disclosed herein or that have conservative amino acid 
variations, as described above. These variations 
include those recognized by skilled artisans as those 
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that do not substantially alter the tertiary structure 
of the protein. 

Further provided are nucleic acids encoding 
CARD- containing polypeptides that, by virtue of the 
5 degeneracy of the genetic code, do not necessarily 
hybridize to the invention nucleic acids under 
specified hybridization conditions. Preferred nucleic 
acids encoding the invention CARD- containing 
polypeptides are comprised of nucleotides that encode 
10 substantially the same amino acid sequence as set forth 
in SEQ ID N0S:12, 168, 188, 170, 172, 174, 176, 97, 99, 
101, 103, 178, 180, 182, 184, 86 and 90. 



Hybridization refers to the binding of 
complementary strands of nucleic acid (i.e., 
15 sense : ant isense strands or probe : target -DNA) to each 

other through hydrogen bonds, similar to the bonds that 
naturally occur in chromosomal DNA . Stringency levels 
used to hybridize a given probe with target -DNA can be 
readily varied by those of skill in the art. 



2 0 The phrase "stringent hybridization" is used 

herein to refer to conditions under which polynucleic 
acid hybrids are stable. As known to those of skill in 
the art, the stability of hybrids is reflected in the 
melting temperature (Tm) of the hybrids. In general, 

25 the stability of a hybrid is a function of sodium ion 
concentration and temperature. Typically, the 
hybridization reaction is performed under conditions of 
lower stringency, followed by washes of varying, but 
higher, stringency. Reference to hybridization 

30 stringency relates to such washing conditions. 



As used herein, the phrase "moderately 
stringent hybridization" refers to conditions that 
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permit target-nucleic acid to bind a complementary 
nucleic acid. The hybridized nucleic acids will 
generally have at least about 60% identity, at least 
about 75% identity, such as at least about 85% 
5 identity; or at least about 90% identity. Moderately 
stringent conditions are conditions equivalent to 
hybridization in 50% formamide, 5X Denhart r s solution, 
5X SSPE, 0.2% SDS at 42°C, followed by washing in 0 . 2X 
SSPE, 0.2% SDS, at 42°C. 



10 The phrase "high stringency hybridization" 

refers to conditions that permit hybridization of only 
those nucleic acid sequences that form stable hybrids 
in 0.018M NaCl at 65°C, for example, if a hybrid is not 
stable in 0.018M NaCl at 65°C, it will not be stable 

15 under high stringency conditions, as contemplated 

herein. High stringency conditions can be provided, 
for example, by hybridization in 50% formamide, 5X 
Denhart ! s solution, 5X SSPE, 0.2% SDS at 42 °C, followed 
by washing in 0 . IX SSPE, and 0.1% SDS at 65°C. 

2 0 The phrase "low stringency hybridization" 

refers to conditions equivalent to hybridization in 10% 
formamide, 5X Denhart ' s solution, 6X SSPE, 0.2% SDS- at 
22°C, followed by washing in IX SSPE, 0.2% SDS, at 
3 7°C. Denhart 's solution contains 1% Ficoll, 1% 
25 polyvinylpyrolidone, and 1% bovine serum albumin (BSA) . 
20X SSPE (sodium chloride, sodium phosphate, ethylene 
diamide tetraacetic acid (EDTA) ) contains 3M sodium 
chloride, 0.2M sodium phosphate, and 0.025 M (EDTA). 
Other suitable moderate stringency and high stringency 

3 0 hybridization buffers and conditions are well known to 

those of skill in the art and are described, for 
example, in Sambrook et al . , supra (1989); and Ausubel 
et al . , supra . 2000). Nucleic acids encoding 
polypeptides hybridize under moderately stringent or 
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high stringency conditions to substantially the entire 
sequence, or substantial portions, for example, 
typically at least 15-30 nucleotides of the nucleic 
acid sequence set forth in SEQ ID NOS-.ll, 167, 187, 
5 169, 171, 173, 175, 96, 98, 100, 102, 177, 179, 181, 
183, 85 and 89. 



As used herein, the term "degenerate" refers 
to codons that differ in at least one nucleotide from a 
reference nucleic acid, e.g., SEQ ID NOSill, 167, 187, 

10 169, 171, 173, 175, 96, 98, 100, 102, 177, 179, 181, 

183, 8 5 and 89, but encode the same amino acids as the 
reference nucleic acid. For example, codons specified 
by the triplets "UCU" , "UCC", "UCA" , and "UCG" are 
degenerate with respect to each other since all four of 

15 these codons encode the amino acid serine. 



The invention also provides a modification of 
a nucleotide sequence that hybridizes to a CARD- 
encoding nucleic acid molecule, for example, a nucleic 
acid molecule referenced as any of SEQ ID NOS:ll, 167, 

20 187, 169, 171, 173, 175, 96, 98, 100, 102, 177, 179/ 
181, 183, 85 and 89 under moderately stringent 
conditions. Modifications of nucleotide sequences, 
where the modification has at least 60% identity to a 
CARD-encoding nucleotide sequence, are also provided. 

25 The invention also provides modification of a CARD- 
encoding nucleotide sequence having at least 65% 
identity, at least 70% identity, at least 72% identity, 
at least 74% identity, at least 76% identity, at least 
78% identity, at least 80% identity, at least 82% 

30 identity, at least 84% identity, at least 86% identity, 
at least 88% identity, at least 90% identity, at least 
91% identity, at least 92% identity, at least 93% 
identity, at least 94% identity, at least 95% identity, 



WO 01/90156 PCT/US01/17158 

39 

at least 96% identity, at least 97% identity, at least 
98% identity or at least 99% identity. 

Identity of any two nucleic acid or amino 
acid sequences can be determined by those skilled in 
5 the art based, for example, on a BLAST 2.0 computer 
alignment, using default parameters. BLAST 2.0 
searching is known in the art and is publicly 
available, for example, at 

http://www.ncbi.nlm.nih.gov/BLAST/, as described by 
10 Tatiana et al . , FEMS Microbiol Lett. 174:247-250 

(1999); Altschul et al . , Nucleic Acids Res. , 25:3389- 
3402 (1997) . 

One means of isolating a nucleic acid 
encoding a CARD- containing polypeptide is to probe a 
15 cDNA library or genomic library with a natural or 

artificially designed nucleic acid probe using methods 
well known in the art. Nucleic acid probes derived 
from a CARD-encoding gene are particularly useful for 
this purpose. DNA and cDNA molecules that encode CARD- 

2 0 containing polypeptides can be used to obtain 

complementary genomic DNA, cDNA or RNA from mammals, 
for example, human, mouse, rat, rabbit, pig, and the 
like, or other animal sources, or to isolate related 
cDNA or genomic clones by the screening of cDNA or 
25 genomic libraries, by methods well known in the art 

(see, for example, the Examples set forth hereinafter; 
and Sambrook et al . , supra , 1989; Ausubel et al., 
supra , 2000) . 

Another useful method for producing a CARD- 

3 0 encoding nucleic acid molecule of the invention 

involves amplification of the nucleic acid molecule 
using PCR and invention oligonucleotides and, 
optionally, purification of the resulting product by 
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gel electrophoresis. Either PCR or RT-PCR can be used 
to produce a CARD-encoding nucleic acid molecule having 
any desired nucleotide boundaries as described in the 
Examples. Desired modifications to the nucleic acid 
5 sequence can also be introduced by choosing an 

appropriate oligonucleotide primer with one or more 
additions, deletions or substitutions. Such nucleic 
acid molecules can be amplified exponentially starting 
from as little as a single gene or mRNA copy, from any 
10 cell, tissue or species of interest. 

The invention additionally provides a nucleic 
acid that hybridizes under high stringency conditions 
to the CARD coding portion of any of SEQ ID NOSrll, 
187, 96, 98, 100, 102, 85 and 89, such as to any of SEQ 
15 ID NOS: 168, 170, 172 and 178. The invention also 
provides a nucleic acid having a nucleotide sequence 
substantially the same as set that forth in any of SEQ 
ID 11, 167, 187, 169, 171, 173, 175, 96, 98, 100, 102, 
177, 179, 181, 183, 85 and 89. 

The invention also provides a method for 
identifying nucleic acids encoding a mammalian CARD- 
containing polypeptide by contacting a sample 
containing nucleic acids with one or more invention 
nucleic acid molecules or oligonucleotides, wherein the 
contacting is effected under high stringency 
hybridization conditions, and identifying a nucleic 
acid that hybridizes to the oligonucleotide. The 
invention additionally provides a method of detecting a 
CARD-encoding nucleic acid molecule in a sample by 
contacting the sample with two or more invention 
oligonucleotides, amplifying a nucleic acid molecule, 
and detecting the amplification. The amplification can 
be performed, for example, using PCR. The invention 
further provides oligonucleotides that function as 
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single stranded nucleic acid primers for amplification 
of a CARD-encoding nucleic acid, wherein the primers 
comprise a nucleic acid sequence derived from the 
nucleic acid sequences set forth as SEQ ID NOSrll, 187 , 
5 96, 98, 100, 102, 85 and 89. 

In accordance with a further embodiment of 
the present invention, optionally labeled CARD-encoding 
cDNAs, or fragments thereof, can be employed to probe 
library(ies) such as cDNA, genomic, BAC, and the like 

10 for predominant nucleic acid sequences or additional 
nucleic acid sequences encoding novel CARD-containing 
polypeptides. Construction and screening of suitable 
mammalian cDNA libraries, including human cDNA 
libraries, is well-known in the art, as demonstrated, 

15 for example, in Ausubel et al . , supra . Screening of 
such a cDNA library is initially carried out under 
low- stringency conditions, which comprise a temperature 
of less than about 42 °C, a formamide concentration of 
less than about 50%, and a moderate to low salt 

2 0 concentration. 

Probe-based screening conditions can comprise 
a temperature of about 3 7°C, a formamide concentration 
of about 2 0%, and a salt concentration of about 5X 
standard saline citrate (SSC; 20X SSC contains 3M 

25 sodium chloride, 0.3M sodium citrate, pH 7.0). Such 
conditions will allow the identification of sequences 
which have a substantial degree of similarity with the 
probe sequence, without requiring perfect homology. 
The phrase "substantial similarity" refers to sequences 

30 which share at least 50% homology. Hybridization 

conditions are selected which allow the identification 
of sequences having at least 70% homology with the 
probe, while discriminating against sequences which 
have a lower degree of homology with the probe. As a 



I 



WO 01/90156 PCT/US01/17158 

42 

result, nucleic acids having substantially the same 
nucleotide sequence as any of SEQ ID NOS:ll, 167, 187, 
169, 171, 173, 175, 96, 98, 100, 102, 177, 179, 181, 
183, 85 and 89 are obtained. 



5 As used herein, a nucleic acid "probe" is 

single-stranded nucleic acid, or analog thereof, that 
has a sequence of nucleotides that includes at least 
15, at least 20, at least 50, at least 100, at least 
200, at least 300, at least 400, or at least 500 

10 contiguous bases that are substantially the same as, or 
the complement of, any contiguous bases set forth in 
any of SEQ ID NOS:ll, 187, 96, 98, 100, 102, 85 and 89. 
In addition, the entire cDNA encoding region of an 
invention CARD-containing polypeptide, or an entire 

15 sequence substantially the same as SEQ ID N0S:11, 187, 
96, 98, 100, 102, 85 and 89 can be used as a probe. 
Probes can be labeled by methods well-known in the art, 
as described hereinafter, and used in various 
diagnostic kits. 

20 The invention additionally provides an 

oligonucleotide comprising between 15 and 3 00 
contiguous nucleotides of any of SEQ ID NOS-.ll, 187, 
96, 98, 100, 102, 85 and 89 or the anti-sense strand 
thereof. As used herein, the term "oligonucleotide" 

25 refers to a nucleic acid molecule that includes at 
least 15 contiguous nucleotides from a reference 
nucleotide sequence, can include at least 16, 17, 18, 
19, 20 or at least 2 5 contiguous nucleotides, and often 
includes at least 30, 40, 50, 60, 70, 80, 90, 100, 125, 

30 150, 175, 200, 225, 250, 275, 300, 325, up to 350 

contiguous nucleotides from the reference nucleotide 
sequence. The reference nucleotide sequence can be the 
sense strand or the anti- sense strand. 
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The oligonucleotides of the invention that 
contain at least 15 contiguous nucleotides of a 
reference CARD- encoding nucleotide sequence are able to 
hybridize to CARD - encoding nucleotide sequences under 
5 moderately stringent hybridization conditions and thus 
can be advantageously used, for example, as probes to 
detect CARD-encoding DNA or RNA in a sample, and to 
detect splice variants thereof; as sequencing or PCR 
primers; as antisense reagents to block transcription 
10 of CARD-encoding RNA in cells; or in other applications 
known to those skilled in the art in which 
hybridization to a CARD-encoding nucleic acid molecule 
is desirable. 

In accordance with another embodiment of the 
15 invention, a method is provided for identifying nucleic 
acids encoding a CARD- containing polypeptide. The 
method comprises contacting a sample containing nucleic 
acids with an invention probe or an invention 
oligonucleotide, wherein said contacting is effected 
20 under high stringency hybridization conditions, and . 
identifying nucleic acids which hybridize thereto. 
Methods for identification of nucleic acids encoding a 
CARD -containing polypeptide are disclosed herein and 
exemplified in the Examples. 

25 Also provided in accordance with present 

invention is a method for identifying a CARD-encoding 
nucleotide sequence comprising the steps of using a 
CARD-encoding nucleotide sequence selected from SEQ ID 
NOS:ll, 167, 187, 169, 171, 173, 175, 96, 98, 100, 102, 

30 177, 179, 181, 183, 85 and 89 to identify a candidate 
CARD-encoding nucleotide sequence and verifying the 
candidate CARD-encoding nucleotide sequence by aligning 
the candidate sequence with known CARD-encoding 
nucleotide sequences, where a conserved CARD domain 
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sequence or a predicted three dimensional polypeptide 
structure similar to a known CARD domain three 
dimensional structure confirms the candidate sequence 
as a CARD-encoding sequence. Methods for identifying 
5 CARD-encoding sequences are provided herein (See 
Examples) . 



It is understood that a CARD-encoding nucleic 
acid molecule of the invention, as used herein, 
specifically excludes previously known nucleic acid 

10 molecules consisting of nucleotide sequences having 
identity with the CARD-encoding nucleotide sequence 
(SEQ ID NOS:ll, 167, 187, 169, 171, 173, 175, 96, 98, 
100, 102, 177, 179, 181, 183, 85 and 89), such as 
Expressed Sequence Tags (ESTs) , Sequence Tagged Sites 

15 (STSs) and genomic fragments, deposited in public 

databases such as the nr, dbest, dbsts, gss and htgs 
databases, which are available for searching at 
http : / /www . ncbi . nlm.nih.gov/blast/ . 

In particular, an invention CARD-encoding 
20 nucleic acid molecule excludes the exact, specific and 
complete nucleic acid molecule sequence corresponding 
to any of the nucleotide sequences having the Genbank 
(gb) , EMBL (emb) or DDBJ (dbj ) accession numbers 
described below . Accession numbers specifically 
25 excluded include 61:6165147 ( Phase- 1) , AC007728 (Phase- 
1), NT-002476 (Phase-1) , AC010968 (Phase-1) , AP001153, 
AC022468 (Phase-1), GI:6253000 (Phase-1), AC0097959 
(Phase-1) , GI: 6497652 (Phase-1) (contig:23086 :40635) , 
GI:6497652 (Phase-1) (contig :41136 : 57024) , AC023068 
30 (Phase-1), W58453, AA257158, AA046000, AW085161, 
AI189838, AA418021, AA046105, W58488, AA418193, 
AA257066, AI217611, AW295205, AI023795, AL389934, 
AA070591, AA070591, AC027011, AP002787, AQ889169, 
AV719179, AI263294, AV656315, AW337918, BF207840, • 
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AW418826, BK903662, AI023795, H25984, AL121653 and 
NT_005194.1. The human contig referenced as GenBank 
accession No. AC007608 is also specifically excluded 
from a CARD encoding nucleic acid molecule. The 
5 genomic contigs referenced as GenBank accession numbers 
GI 5001450, GI 8575872 and GI 9795562 are also 
specifically excluded from invention nucleic acid 
molecules. Since one of skill in the art will realize 
that the above-recited excluded sequences may be 
10 revised at a later date, the skilled artisan will 

recognize that the above-recited sequences are excluded 
as they stand on the priority date of this application. 

The isolated nucleic acid molecules of the 
invention can be used in a variety of diagnostic and 
therapeutic applications. For example, the isolated 
nucleic acid molecules of the invention can be used as 
probes, as described above; as templates for the 
recombinant expression of CARD- containing polypeptides; 
or in screening assays such as two-hybrid assays to 
identify cellular molecules that bind CARD-containing 
polypeptides . 

The invention thus provides methods for 
detecting a CARD-encoding nucleic acid in a sample. 
The methods of detecting a CARD-encoding nucleic acid 
in a sample can be either qualitative or quantitative, 
as desired. For example, the presence, abundance, 
integrity or structure of a CARD-encoding nucleic acid 
can be determined, as desired, depending on the assay 
format and the probe used for hybridization or primer 
pair chosen for application. 

Useful assays for detecting a CARD-containing 
nucleic acid based on specific hybridization with an 
isolated invention oligonucleotide are well known in 
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the art and include, for example, in situ 
hybridization, which can be used to detect altered 
chromosomal location of the nucleic acid molecule, 
altered gene copy number, and RNA abundance, depending 
5 on the assay format used. Other hybridization assays 
include, for example, Northern blots and RNase 
protection assays, which can be used to determine the 
abundance and integrity of different RNA splice 
variants, and Southern blots, which can be used to 

10 determine the copy number and integrity of DNA. A 
hybridization probe can be labeled with any suitable 
detectable moiety, such as a radioisotope, 
f luorochrome, chemiluminescent marker, biotin, or other 
detectable moiety known in the art that is detectable 

15 by analytical methods. 



As used herein, the terms "label" and 
"indicating means" in their various grammatical forms 
refer to single atoms and molecules that are either 
directly or indirectly involved in the production of a 

20 detectable signal. Any label or indicating means can 
be linked to invention nucleic acid probes, expressed 
proteins, polypeptide fragments, or antibody molecules. 
These atoms or molecules can be used alone or in 
conjunction with additional reagents. Such labels are 

2 5 themselves well-known in clinical diagnostic chemistry. 



Useful assays for detecting a CARD-encoding 
nucleic acid in a sample based on amplifying a CARD- 
encoding nucleic acid with two or more invention 
oligonucleotides are also well known in the art, and 
3 0 include, for example, qualitative or quantitative 

polymerase chain reaction (PCR) ; reverse-transcription 
PGR (RT-PCR) ; single strand conformational polymorphism 
(SSCP) analysis, which can readily identify a single 
point mutation in DNA based on differences in the 
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secondary structure of single- strand DNA that produce 
an altered electrophoretic mobility upon non-denaturing 
gel electrophoresis; and coupled PGR, transcription and 
translation assays, such as a protein truncation test, 
5 in which a mutation in DNA is determined by an altered 
protein product on an electrophoresis gel. 
Additionally, the amplified CARD-encoding nucleic acid 
can be sequenced to detect mutations and mutational 
hot-spots, and specific assays for large-scale 
10 screening of samples to identify such mutations can be 
developed. 



Also provided are antisense-nucleic acids 
having a sequence capable of binding specifically with 
full-length or any portion of an mRNA that encodes 

15 CARD-containing polypeptides so as to prevent 

translation of the mRNA. The antisense-nucleic acid 
can have a sequence capable of binding specifically 
with any portion of the sequence of the cDNA encoding 
CARD-containing polypeptides. As used herein, the 

20 phrase "binding specif ically ,f encompasses the ability 
of a nucleic acid sequence to recognize a complementary 
nucleic acid sequence and to form double-helical 
segments therewith via the formation of hydrogen bonds 
between the complementary base pairs. An example of an 

25 antisense-nucleic acid is an antisense-nucleic acid 
comprising chemical analogs of nucleotides. 

The present invention provides means to alter 
levels of expression of CARD-containing polypeptides by 
recombinant ly expressing CARD-containing ant i -sense 
30 nucleic acids or employing synthetic ant i- sense nucleic 
acid compositions (hereinafter SANC) that inhibit 
translation of mRNA encoding these polypeptides. 
Synthetic oligonucleotides, or other antisense-nucleic 
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acid chemical structures designed to recognize and 
selectively bind to mRNA are constructed to be 
complementary to full-length or portions of a CARD- 
encoding strand, including nucleotide sequences 
5 substantially the same as SEQ ID N0S:11, 187, 96, 98, 
100, 102, 85 and 89. 



The SANC is designed to be stable in the 
blood stream for administration to a subject by 
injection, or in laboratory cell culture conditions. 

10 The SANC is designed to be capable of passing through 
the cell membrane in order to enter the cytoplasm of 
the cell by virtue of physical and chemical properties 
of the SANC, which render it capable of passing through 
cell membranes, for example, by designing small, 

15 hydrophobic SANC chemical structures, or by virtue of 
specific transport systems in the cell which recognize 
and transport the SANC into the cell. In addition, the 
SANC can be designed for administration only to certain 
selected cell populations by targeting the SANC to be 

20 recognized by specific cellular uptake mechanisms which 
bind and take up the SANC only within select cell 
populations. In a particular embodiment the SANC is an 
antisense oligonucleotide. 

For example, the SANC may be designed to bind 
25 to a receptor found only in a certain cell type, as 
discussed above. The SANC is also designed to 
recognize and selectively bind to target mRNA sequence, 
which can correspond to a sequence contained within the 
sequences shown in SEQ ID NOS: 11, 187, 96, 98, 100, 
30 102, 85 and 89. The SANC is designed to inactivate 
target mRNA sequence by either binding thereto and 
inducing degradation of the mRNA by, for example, RNase 
I digestion, or inhibiting translation of mRNA target 
sequence by interfering with the binding of 
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translation-regulating factors or ribosomes, or 
inclusion of other chemical structures, such as 
ribozyme sequences or reactive chemical groups which 
either degrade or chemically modify the target mRNA. 
5 SANCs have been shown to be capable of such properties 
when directed against mRNA targets (see Cohen et al., 
TIPS , 10:435 (1989) and Weintraub, Sci . American . 
January (1990) , pp.40) . 

The invention further provides a method of 
10 altering the level of a biochemical process modulated 
by a CARD- containing polypeptide by introducing an 
antisense nucleotide sequence into the cell, wherein 
the antisense nucleotide sequence specifically 
hybridizes to a CARD-encoding nucleic acid molecule, 
15 wherein the hybridization reduces or inhibits the 

expression of the CARD- containing polypeptide in the 
cell. The use of anti -sense nucleic acids, including 
recombinant anti -sense nucleic acids or SANCs, can be 
advantageously used to inhibit cell death. 

2 0 Compositions comprising an amount of the 

antisense-nucleic acid of the invention, effective to 
reduce expression of CARD- containing polypeptides by 
entering a cell and binding specifically to CARD- 
encoding mRNA so as to prevent translation and an 
25 acceptable hydrophobic carrier capable of passing 
through a cell membrane are also provided herein. 
Suitable hydrophobic carriers are described, for 
example, in U.S. Patent Nos . 5,334,761; 4,88 9,953; 
4,897,355, and the like. The acceptable hydrophobic 

3 0 carrier capable of passing through cell membranes may 

also comprise a structure which binds to a receptor 
specific for a selected cell type and is thereby taken 
up by cells of the selected cell type. For example, 
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the structure can be part of a protein known to bind to 
a cell -type specific receptor such as a tumor. 

Ant i sense -nucleic acid compositions are 
useful to inhibit translation of mRNA encoding 
5 invention polypeptides. Synthetic oligonucleotides, or 
other antisense chemical structures are designed to 
bind to CARD-encoding mRNA and inhibit translation of 
mRNA and are useful as compositions to inhibit 
expression of CARD-encoding genes or CARD- associated 
10 polypeptide genes in a tissue sample or in a subject. 



The invention also provides vectors 
containing the CARD-encoding nucleic acids of the 
invention. Suitable expression vectors are well-known 
in the art and include vectors capable of expressing 

15 nucleic acid operatively linked to a regulatory 
sequence or element such as a promoter region or 
enhancer region that is capable of regulating 
expression of such nucleic acid. Appropriate 
expression vectors include those that are replicable in 

20 eukaryotic cells and/or prokaryotic cells and those 

that remain episomal or those which integrate into the 
host cell genome. 

Promoters or enhancers, depending upon the 
nature of the regulation, can be constitutive or 

2 5 regulated. The regulatory sequences or regulatory 
elements are operatively linked to a nucleic acid of 
the invention such that the physical aind functional 
relationship between the nucleic acid and the 
regulatory sequence allows transcription of the nucleic 

30 acid. 



Suitable vectors for expression in 
prokaryotic or eukaryotic cells are well known to those 
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skilled in the art (see, for example, Ausubel et al . , 
supra, 2000) . Vectors useful for expression in 
eukaryotic cells can include, for example, regulatory 
elements including the SV4 0 early promoter, the 
5 cytomegalovirus (CMV) promoter, the mouse mammary tumor 
virus (MMTV) steroid-inducible promoter, Moloney 
murine leukemia virus (MMLV) promoter, and the like. 
The vectors of the invention are useful for subcloning 
and amplifying a CARD-encoding nucleic acid molecule 

10 and for recombinantly expressing a CARD -containing 
polypeptide. A vector of the invention can include, 
for example, viral vectors such as a bacteriophage, a 
baculovirus or a retrovirus; cosmids or plasmids; and, 
particularly for cloning large nucleic acid molecules, 

15 bacterial artificial chromosome vectors (BACs) and 
yeast artificial chromosome vectors (YACs) . Such 
vectors are commercially available, and their uses are 
well known in the art. One skilled in the art will 
know or can readily determine an appropriate promoter 

20 for expression in a particular host cell. 

The invention additionally provides 
recombinant cells containing CARD-encoding nucleic 
acids of the invention. The recombinant cells are 
generated by introducing into a host cell a vector 

2 5 containing a CARD-encoding nucleic acid molecule. The 

recombinant cells are transducted, transfected or 
otherwise genetically modified. Exemplary host cells 
that can be used to express recombinant CARD molecules 
include mammalian primary cells; established mammalian 

3 0 cell lines, such as COS, CHO, HeLa, NIH3T3, HEK 293 and 

PC12 cells; amphibian cells, such as Xenopus embryos 
and oocytes and other vertebrate cells. Exemplary host 
cells also include insect cells such as Drosophila, 
yeast cells such as Saccharomyces cerevisiae, 
35 Saccharomyces poxnbe, or Pichia pas tori s, and 
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prokaryotic cells such as Escherichia coli. Additional 
host cells can be obtained, for example, from ATCC 
(Manassas , VA) . 



In one embodiment, CARD-encoding nucleic 
5 acids can be delivered into mammalian cells, either in 
■vivo or in vitro using suitable vectors well-known in 
the art. Suitable vectors for delivering a CARD- 
containing polypeptide, or a functional fragment 
thereof to a mammalian cell, include viral vectors such 

10 as retroviral vectors, adenovirus, adeno-associated 
virus, lentivirus, herpesvirus, as well as non-viral 
vectors such as plasmid vectors. Such vectors are 
useful for providing therapeutic amounts of a CARD- 
containing polypeptide (see, for example, U.S. Patent 

15 No. 5,399,346, issued March 21, 1995). Delivery of 

CARD polypeptides or nucleic acids therapeutically can 
be particularly useful when targeted to a tumor cell, 
thereby inducing apoptosis in tumor cells. In 
addition, where it is desirable to limit or reduce the 

20 in vivo expression of a CARD- containing polypeptide, 
the introduction of the antisense strand of the 
invention nucleic acid is contemplated. 

Viral based systems provide the advantage of 
being able to introduce relatively high levels of the 

25 heterologous nucleic acid into a variety of cells. 
Suitable viral vectors for introducing an invention 
CARD-encoding nucleic acid into mammalian cells are 
well known in the art. These viral vectors include, 
for example, Herpes simplex virus vectors (Geller et 

30 al., Science . 241:1667-1669 (1988)); vaccinia virus 

vectors (Piccini et al . , Meth. Enzvmology , 153:545-563 
(1987)); cytomegalovirus vectors (Mocarski et al . , in 
Viral Vectors . Y. Gluzman and S.H. Hughes, Eds., Cold 
Spring Harbor Laboratory, Cold Spring Harbor, N.Y., 
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1988, pp. 78-84)); Moloney murine leukemia virus 
vectors (Danos et al . , Proc. Natl. Acad. Sci. USA , 
85:6460-6464 (1988); Blaese et al . , Science , 270:475- 
479 (1995)/ Onodera et al . , J. Virol. , 72:1769-1774 
5 (1998) ) ; adenovirus vectors (Berkner, Biotechniques , 

6:616-626 (1988); Cotten et al Proc. Natl. Acad. Sci. 
USA, 89:6094-6098 (1992); Graham et al . , Meth. Mol . 
Biol . , 7:109-127 (1991); Li et al . , Human Gene Therapy , 
4:403-409 (1993); Zabner et al . , Nature Genetics , 6:75- 

10 83 (1994)); adeno-associated virus vectors (Goldman et 
al., Human Gene Therapy , 10:2261-2268 (1997); Greelish 
et al., Nature Med . , 5:439-443 (1999); Wang et al . , 
Proc. Natl. Acad. Sci. USA , 96:3906-3910 (1999); Snyder 
et al., Nature Med . , 5:64-70 (1999); Herzog et al . , 

15 Nature Med. , 5:56-63 (1999)); retrovirus vectors 
(Donahue et al . , Nature Med. , 4:181-186 (1998); 
Shackleford et al . , Proc. Natl. Acad. Sci. USA , 
85:9655-9659 (1988); U.S. Patent Nos . 4,405,712, 
4,650,764 and 5,252,479, and WIPO publications WO 

20 92/07573, WO 90/06997, WO 89/05345, WO 92/05266 and WO 
92/14829; and lentivirus vectors (Kafri et al . , Nature 
Genetics , 17:314-317 (1997)). 

For example, in one embodiment of the present 
invention, adenovirus -transf errin/polylysine-DNA 

25 (TfAdpl-DNA) vector complexes (Wagner et al . , Proc . 

Natl. Acad. Sci., USA . 89:6099-6103 (1992); Curiel et 
al., Hum. Gene Ther. , 3:147-154 (1992); Gao et al . , 
Hum. Ge ne Ther. . 4:14-24 (1993)) are employed to 
transduce mammalian cells with heterologous CARD- 

30 encoding nucleic acid. Any of the plasmid expression 
vectors described herein may be employed in a 
TfAdpl-DNA complex. 

Vectors useful for therapeutic administration 
of a CARD-encoding nucleic acid can contain a 
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regulatory element that provides tissue specific or 
inducible expression of an operatively linked nucleic 
acid. One skilled in the art can readily determine an 
appropriate tissue- specific promoter or enhancer that 
5 allows expression of a CARD polypeptide or nucleic acid 
in a desired tissue. Any of a variety of inducible 
promoters or enhancers can also be included in the 
vector for regulatable expression of a CARD polypeptide 
or nucleic acid. Such inducible systems, include, for 
10 example, tetracycline inducible system (Gossen & 
Bizard, Proc . Natl. Acad. Sci. USA , 89:5547-5551 

(1992) ;, Gossen et al . , Science , 268:1766-1769 (1995); 
Clontech, Palo Alto, CA) ; metalothionein promoter 
induced by heavy metals; insect steroid hormone 

15 responsive to ecdysone or related steroids such as 
muristerone (No et al . , Proc. Natl. Acad. Sci. USA , 
93:3346-3351 (1996); Yao et al Nature, 366:476-479 

(1993) ; Invitrogen, Carlsbad, CA) ; mouse mammary tumor 
virus (MMTV) induced by steroids such as glucocortocoid 

20 and estrogen (Lee et al . , Nature , 294:228-232 (1981); 
and heat shock promoters inducible by temperature 
changes . 



An inducible system particularly useful for 
therapeutic administration utilizes an inducible 

25 promoter that can be regulated to deliver a level of 
therapeutic product in response to a given level of 
drug administered to an individual and to have little 
or no expression of the therapeutic product in the 
absence of the drug. One such system utilizes a Gal4 

3 0 fusion that is inducible by an antiprogestin such as 

mifepristone in a modified adenovirus vector (Burien et 
al., Proc. Natl. Acad. Sci. USA . 96:355-360 (1999). 
Another such inducible system utilizes the drug 
rapamycin to induce reconstitution of a transcriptional 

35 activator containing rapamycin binding domains of 
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FKBP12 and FRAP in an adeno-associated virus vector (Ye 
et al., Science . 283:88-91 (1999)). It is understood 
that any combination of an inducible system can be 
combined in any suitable vector, including those 
5 disclosed herein. Such a regulatable inducible system 
is advantageous because the level of expression of the 
therapeutic product can be controlled by the amount of 
drug administered to the individual or, if desired, 
expression of the therapeutic product can be terminated 
10 by stopping administration of the drug. 



The invention also provides a method for 
expression of a CARD- containing polypeptide by 
culturing cells containing a CARD-encoding nucleic acid 
under conditions suitable for expression of a CARD- 

15 containing polypeptide. Thus, there is provided a 
method for the recombinant production of a CARD- 
containing polypeptide of the invention by expressing 
the CARD-encoding nucleic acid sequences in suitable 
host cells. Recombinant DNA expression systems that 

20 are suitable to produce a CARD- containing polypeptide 
described herein are well-known in the art (see, for 
example, Ausubel et al . , supra , 2000). For example, 
the above-described nucleotide sequences can be 
incorporated into vectors for further manipulation. As 

2 5 used herein, vector refers to a recombinant DNA or RNA 
plasmid or virus containing discrete elements that are 
used to introduce heterologous DNA into cells for 
either expression or replication thereof. 



The invention additionally provides an 
30 isolated anti-CARD antibody having specific reactivity 
with a invention CARD-containing polypeptide. The 
anti-CARD antibody can be a monoclonal antibody or a 
polyclonal antibody. The invention further provides 
cell lines producing monoclonal antibodies having 
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specific reactivity with an invention CARD -containing 
protien . 



The invention thus provides antibodies that 
specifically bind a CARD- containing polypeptide. As 
5 used herein, the term "antibody" is used in its 

broadest sense to include polyclonal and monoclonal 
antibodies, as well as antigen binding fragments of 
such antibodies. With regard to an anti-CARD antibody 
of the invention, the term "antigen" means a native or 

10 synthesized CARD-containing polypeptide or fragment 
thereof. An anti-CARD antibody, or antigen binding 
fragment of such an antibody, is characterized by 
having specific binding activity for a CARD polypeptide 
or a peptide portion thereof of at least about 

15 1 x 10 5 M" 1 . Thus, Fab, F(ab') 2 , Fd and Fv fragments of 
an anti-CARD antibody, which retain specific binding 
activity for a CARD-containing polypeptide, are 
included within the definition of an antibody. 
Specific binding activity of a CARD-containing 

2 0 polypeptide can be readily determined by one skilled in 
the art, for example, by comparing the binding activity 
of an anti-CARD antibody to a CARD-containing 
polypeptide versus a reference polypeptide that is not 
a CARD-containing polypeptide. Methods of preparing 

25 polyclonal or monoclonal antibodies are well known to 
those skilled in the art (see, for example, Harlow and 
Lane, Antibodies: A Laboratory Manual , Cold Spring 
Harbor Laboratory Press (1988) ) . 



In addition, the term "antibody" as used 
3 0 herein includes naturally occurring antibodies as well 
as non-naturally occurring antibodies, including, for 
example, single chain antibodies, chimeric, 
bifunctional and humanized antibodies, as well as 
antigen-binding fragments thereof. Such non-naturally 
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occurring antibodies can be constructed using solid 
phase peptide synthesis, can be produced recombinantly 
or can be obtained, for example, by screening 
combinatorial libraries consisting of variable heavy 
5 chains and variable light chains as described by Huse 
et al., Science 246:1275-1281 (1989)). These and other 
methods of making, for example, chimeric, humanized, 
CDR-grafted, single chain, and bifunctional antibodies 
are well known to those skilled in the art (Winter and 

10 Harris, Immunol, Today 14:243-246 (1993); Ward et al . , 
Nature 341:544-546 (1989) ; Harlow and Lane, supra, 
1988); Hilyard et al . , Protein Engineering: A 
practical approach (IRL Press 1992) ; Borrabeck, 
Antibody Engineering , 2d ed. (Oxford University Press 

15 1995) ) . 

Anti-CARD antibodies can be raised using a 
CARD immunogen such as an isolated CARD- containing 
polypeptide having substantially the same amino acid 
sequence as SEQ ID NOS:12, 188, 97, 99, 101, 103, 86 

2 0 and 90, or a fragment thereof, which can be prepared 
from natural sources or produced recombinantly, or a 
peptide portion of the CARD- containing polypeptide. 
Such peptide portions of a CARD- containing polypeptide 
are functional antigenic fragments if the antigenic 

25 peptides can be used to generate a CARD-specif ic 
antibody. A non- immunogenic or weakly immunogenic 
CARD -containing polypeptide or portion thereof can be 
made immunogenic by coupling the hapten to a carrier 
molecule such as bovine serum albumin (BSA) or keyhole 

30 limpet hemocyanin (KLH) . Various other carrier 
molecules and methods for coupling a hapten to a 
carrier molecule are well known in the art (see, for 
example, Harlow and Lane, supra, 1988) . An immunogenic 
CARD- containing polypeptide fragment can also be 

35 generated by expressing the peptide as a fusion 
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protein, for example, to glutathione S transferase 
(GST) , polyHis or the like. Methods for expressing 
peptide fusions are well known to those skilled in the 
art (Ausubel et al . , supra, (2000)). 

5 The invention further provides a method for 

detecting the presence of a human CARD- containing 
polypeptide in a sample by contacting a sample with a 
CARD-specif ic antibody, and detecting the presence of 
specific binding of the antibody to the sample, thereby 

10 detecting the presence of a human CARD- containing 

polypeptide in the sample. CARD-specif ic antibodies 
can be used in diagnostic methods and systems to detect 
the level of CARD- containing polypeptide present in a 
sample. As used herein, the term "sample" is intended 

15 to mean any biological fluid, cell, tissue, organ or 
portion thereof, that includes or potentially includes 
CARD nucleic acids or polypeptides. The term includes 
samples present in an individual as well as samples 
obtained or derived from the individual. For example, 

20 a sample can be a histologic section of a specimen 
obtained by biopsy, or cells that are placed in or 
adapted to tissue culture. A sample further can be a 
subcellular fraction or extract, or a crude or 
substantially pure nucleic acid or polypeptide 

2 5 preparation. 

CARD-specif ic antibodies can also be used for 
the immunoaf f inity or affinity chromatography 
purification of an invention CARD- containing 
polypeptide. In addition, methods are contemplated 

3 0 herein for detecting the presence of an invention CARD- 

containing polypeptide in a cell, comprising contacting 
the cell with an antibody that specifically binds to 
CARD -containing polypeptides under conditions 
permitting binding of the antibody to the CARD- 



WO 01/90156 PCT/US01/17158 

59 

containing polypeptides, detecting the presence of the 
antibody bound to the CARD- containing polypeptide, and 
thereby detecting the presence of invention 
polypeptides in a cell. With respect to the detection 
5 of such polypeptides, the antibodies can be used for in 
vitro diagnostic or in vivo imaging methods. 

Immunological procedures useful for in vitro 
detection of target CARD- containing polypeptides in a 
sample include immunoassays that employ a detectable 

10 antibody. Such immunoassays include, for example, 

immunohistochemistry , immunofluorescence, ELISA assays, 
radioimmunoassay, FACS analysis, immunoprecipitation, 
immunoblot analysis, Pandex microf luorimetric assay, 
agglutination assays, flow cytometry and serum 

15 diagnostic assays, which are well known in the art 

(Harlow and Lane, supra, 1988; Harlow and Lane, Using 
Antibodies: A Laboratory Manual , Cold Spring Harbor 
Press (1999) ) . 



An antibody can be made detectable by various 
20 means well known in the art. For example, a detectable 
marker can be directly attached to the antibody or 
indirectly attached using, for example, a secondary 
agent that recognizes the CARD specific antibody. 
Useful markers include, for example, radionucleotides, 
25 enzymes, binding proteins such as biotin, fluorogens, 
chromogens and chemi luminescent labels. 



An antibody can also be detectable by, for 
example, a fluorescent labeling agent that chemically 
binds to antibodies or antigens without denaturation to 
3 0 form a fluorochrome (dye) that is a useful 
immunof luorescent tracer. A description of 
immunof luorescent analytic techniques is found in 
DeLuca, "Immunofluorescence Analysis", in Antibody As a 
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eds . , John Wiley & Sons, 
which is incorporated herein 



In one embodiment, the indicating group is an 
5 enzyme, such as horseradish peroxidase (HRP) , glucose 
oxidase, and the like. In another embodiment, 
radioactive elements are employed labeling agents. The 
linking of a label to a substrate, i.e., labeling of 
nucleic acid probes, antibodies, polypeptides, and 

10 proteins, is well known in the art. For instance, an 
invention antibody can be labeled by metabolic 
incorporation of radiolabeled amino acids provided in 
the culture medium. See, for example, Galfre et al . , 
Meth. Enzymol . , 73:3-46 (1981). Conventional means of 

15 protein conjugation or coupling by activated functional 
groups are particularly applicable. See, for example, 
Aurameas et al . , Scand . J . Immunol . , Vol . 8 , 
Suppl. 7:7-23 (1978), Rodwell et al . , Biotech. , 
3:889-894 (1984), and U.S. Patent No. 4,493,795. 



2 0 In addition to detecting the presence of a 

CARD- containing polypeptide, invention ant i -CARD 
antibodies are contemplated for use herein to alter the 
activity of the CARD- containing polypeptide in living 
animals, in humans, or in biological tissues or fluids 

25 isolated therefrom. The term "alter" refers to the 
ability of a compound such as a CARD -containing 
polypeptide, a CARD-encoding nucleic acid, an agent or 
other compound to increase or decrease biological 
activity which is modulated by the compound, by 

30 functioning as an agonist or antagonist of the 

compound. Accordingly, compositions comprising a 
carrier and an amount of an antibody having specificity 
for CARD- containing polypeptides effective to block 
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naturally occurring ligands or other CARD-associated 
polypeptides from binding to invention CARD- containing 
polypeptides are contemplated herein. For example, a 
monoclonal antibody directed to an epitope of an 
5 invention CARD- containing polypeptide, including an 
amino acid sequence substantially the same as SEQ ID 
12, 188, 97, 99, 101, 103, 86 and 90, can be useful for 
this purpose. 



The present invention further provides 
10 transgenic non-human mammals that are capable of 
expressing exogenous nucleic acids encoding CARD- 
containing polypeptides. As employed herein, the 
phrase "exogenous nucleic acid" refers to nucleic acid 
sequence which is not native to the host, or which is 
15 present in the host in other than its native 

environment, for example, as part of a genetically 
engineered DNA construct. In addition to naturally 
occurring CARD- containing polypeptide levels, a CARD- 
containing polypeptide of the invention can either be 
2 0 overexpressed or underexpressed in transgenic mammals, 
for example, underexpressed in a knock-out animal. 

Also provided are transgenic non-human 
mammals capable of expressing CARD-encoding nucleic 
acids so mutated as to be incapable of normal activity. 

25 Therefore, the transgenic non-human mammals do not 
express native CARD -containing polypeptide or have 
reduced expression of native CARD- containing 
polypeptide. The present invention also provides 
transgenic non-human mammals having a genome- comprising 

30 antisense nucleic acids complementary to CARD-encoding 
nucleic acids, placed so as to be transcribed into 
antisense mRNA complement airy to CARD-encoding mRNA, 
which hybridizes to the mRNA and, thereby, reduces the 
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translation thereof. The nucleic acid can additionally 
comprise an inducible promoter and/or tissue specific 
regulatory elements, so that expression can be induced, 
or restricted to specific cell types. 



5 Animal model systems useful for elucidating 

the physiological and behavioral roles of CARD- 
containing polypeptides are also provided, and are 
produced by creating transgenic animals in which the 
expression of the CARD- containing polypeptide is 

10 altered using a variety of techniques. Examples of 
such techniques include the insertion of normal or 
mutant versions of nucleic acids encoding a CARD- 
containing polypeptide by microinjection, retroviral 
infection or other means well known to those skilled in 

15 the art, into appropriate fertilized embryos to produce 
a transgenic animal, see, for example, Hogan et al . , 
Manipulating the Mouse Embryo: A Laboratory Manual 
(Cold Spring Harbor Laboratory, (1986)). Transgenic 
animal model systems are useful for in vivo screening 

20 of compounds for identification of specific ligands, 
such as agonists or antagonists, which activate or 
inhibit a biological activity. 



Also contemplated herein, is the use of 
homologous recombination of mutant or normal versions 

2 5 of CARD-encoding genes with the native gene locus in 

transgenic animals, to alter the regulation of 
expression or the structure of CARD - containing 
polypeptides by replacing the endogeneous gene with a 
recombinant or mutated CARD-encoding gene. Methods for 

3 0 producing a transgenic non-human mammal including a 

gene knock-out non-human mammal, are well known to 
those skilled in the art (see, Capecchi et al. , Science 
244:1288 (1989); Zimmer et al . , Nature 338:150 (1989); 
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Shastry, Exoerentia . 51:1028-1039 (1995); Shastry, Mol , 
Cell , Biochem. , 181:163-179 (1998); and U.S. Patent Mo. 
5,616,491, issued April 1, 1997, No. 5,750,826, issued 
May 12, 1998, and No. 5,981,830, issued November 9, 
5 1999) . 



In addition to homologous recombination, 
additional methods such as microinjection can be used 
which add genes to the host genome without removing 
host genes. Microinjection can produce a transgenic 

10 animal that is capable of expressing both endogenous, 
and exogenous CARD -containing polypeptides. Inducible 
promoters can be linked to the coding region of nucleic 
acids to provide a means to regulate expression of the 
transgene. Tissue specific regulatory elements can be 

15 linked to the coding region to permit tissue-specific 
expression of the transgene. Transgenic animal model 
systems are useful for in vivo screening of compounds 
for identification of specific ligands, i.e., agonists 
and antagonists, which activate or inhibit CARD- 

20 containing polypeptide responses. 

In accordance with another embodiment of the 
invention, a method is provided for identifying a CARD- 
associated polypeptide (CAP) . The method is carried 
out by contacting an invention CARD -containing 
25 polypeptide with a candidate CAP and detecting 

association of the CARD- containing polypeptide with the 
CAP. 



As used herein, the term "CARD-associated 
polypeptide" or "CAP" means a polypeptide that can 
30 specifically bind to the CARD- containing polypeptides 
of the invention, or to any functional fragment of a 
CARD- containing polypeptide of the invention. Because 
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CARD -containing polypeptides of the invention contain 
domains which can self -associate , CARD-containing 
polypeptides are encompassed by the term CAP. An 
exemplary CAP is a protein or a polypeptide portion of 
5 a protein that can bind an NB-ARC (NACHT) , CARD, LRR or 
ANGIO-R domain of an invention CARD- containing 
polypeptide. A CAP can be identified, for example, 
using in vitro protein binding assays similar to those 
described in, for example, Ausubel et al . , supra , 2000, 
10 and by in vivo methods including methods such as yeast 
two-hybrid assays, or other protein-interaction assays 
and methods known in the art . 

Normal association of CARD- containing 
polypeptide and a CAP polypeptide in a cell can be 
altered due, for example, to the expression in the cell 
of a variant CAP or CARD-containing polypeptide, 
respectively, either of which can compete with the 
normal binding function of a CARD-containing 
polypeptide and, therefore, can decrease the 
association of CAP and CARD-containing polypeptides in 
a cell. The term "variant" is used generally herein to 
mean a polypeptide that is different from the CAP or 
CARD-containing polypeptide that normally is found in a 
particular cell type. Thus, a variant can include a 
mutated protein or a naturally occurring protein, such 
as an isoform, that is not normally found in a 
particular cell type. 

CARD-containing polypeptides and CARD- 
associated polypeptides of the invention can be 
3 0 characterized, for example, using in vitro binding 
assays or the yeast two hybrid system. An in vivo 
transcription activation assay such as the yeast two 
hybrid system is particularly useful for identifying 



15 



20 



25 



WO 01/90156 PCT/US01/17158 

65 

and manipulating the association of proteins. In 
addition, the results observed in such an assay likely 
mirror the events that naturally occur in a cell. 
Thus, the results obtained in such an in vivo assay can 
5 be predictive of results that can occur in a cell in a 
subject such as a human subject. 



A transcription activation assay such as the 
yeast two hybrid system is based on the modular nature 
of transcription factors, which consist of functionally 

10 separable DNA-binding and trans -activation domains. 

When expressed as separate proteins, these two domains 
fail to mediate gene transcription. However, 
transcription activation activity can be restored if 
the DNA-binding domain and the trans- activation domain 

15 are bridged together due, for example, to the 

association of two proteins. The DNA-binding domain 
and trans-activation domain can be bridged, for 
example, by expressing the DNA-binding domain and 
trans-activation domain as fusion proteins (hybrids) , 

2 0 provided that the proteins that are fused to the 
domains can associate with each other. The 
non-covalent bridging of the two hybrids brings the 
DNA-binding and trans -activation domains together and 
creates a transcriptionally competent complex. The 

2 5 association of the proteins is determined by observing 
transcriptional activation of a reporter gene. 



The yeast two hybrid systems exemplified 
herein use various, strains of S. cerevisiae as host 
cells for vectors that express the hybrid proteins. A 
3 0 transcription activation assay also can be performed 
using, for example, mammalian cells. However, the 
yeast two hybrid system is particularly useful due to 
the ease of working with yeast and the speed with which 
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the assay can be performed. For example, yeast host 
cells containing a lacZ reporter gene linked to a LexA 
operator sequence can be used to demonstrate that a 
CARD domain of an invention CARD-containing polypeptide 
5 can interact with itself or other CARD-containing 

polypeptides. For example, the DNA-binding domain can 
consist of the LexA DNA-binding domain, which binds the 
LexA promoter, fused to the CARD domain of a CARD- 
containing polypeptide of the invention and the 

10 trans-activation domain can consist of the B42 acidic 
region separately fused to several cDNA sequences which 
encode known CARD-containing polypeptides. When the 
LexA domain is non-covalently bridged to a 
trans-activation domain fused to a CARD-containing 

15 polypeptide, the association can activate transcription 
of the reporter gene. 



A CAP, for example, a CARD-containing 
polypeptide, an NB-ARC- containing polypeptide or a LRR- 
containing polypeptide, also can be identified using 

20 well known in vitro assays, for example, an assay 
utilizing a glutathione-S-transf erase (GST) fusion 
protein. Such an in vitro assay provides a simple, 
rapid and inexpensive method for identifying and 
isolating a CAP. Such an in vitro assay is 

25 particularly useful in confirming results obtained in 
vivo and can be used to characterize specific binding 
domains of a CAP. For example, a GST can be fused to a 
CARD-containing polypeptide of the invention, and 
expressed and purified by binding to an affinity matrix 

3 0 containing immobilized glutathione. If desired, a 

sample that can contain a CAP or active fragments of a 
CAP can be passed over an affinity column containing 
bound GST/ CARD and a CAP that binds to a CARD- 
containing polypeptide can be obtained. In addition, 
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GST/ CARD can be used to screen a cDNA expression 
library, wherein binding of the GST/ CARD fusion protein 
to a clone indicates that the clone contains a cDNA 
encoding a CAP. 

5 Thus, one of skill in the art will recognize 

that using the CARD- containing polypeptides described 
herein, a variety of methods, such as protein 
purification, protein interaction cloning, or protein 
mass -spectrometry, can be used to identify a CAP. 

10 Although the term "CAP" is used generally, it 

should be recognized that a CAP that is identified 
using the novel polypeptides described herein can be a 
fragment of a protein. Thus, as used herein, a CAP 
also includes a polypeptide that specifically 

15 associates to a portion of an invention CARD- containing 
polypeptide that does not include a CARD domain. For 
example, a CAP can associate with the NB-ARC domain of 
CLAN or CARD3X. As used herein, a "candidate CAP" 
refers to a polypeptide containing a polypeptide 

2 0 sequence know or suspected of binding one or more CARD- 

containing polypeptides of the invention. Thus, a CAP 
can represent a full-length protein or a CARD- 
associating fragment thereof. Since a CAP polypeptide 
can be a full-length protein or a CARD-associating 
25 fragment thereof, one of skill in the art will 

recognize that a CAP-encoding nucleic acid, such as the 
genomic sequence, an mRNA sequence or a cDNA sequence 
need not encode the full-length protein. Thus, a cDNA 
can encode a polypeptide that is a fragment of a full- 

3 0 length CAP which, nevertheless, binds one or more 

invention CARD- containing polypeptides. It is also 
within the scope of the invention that a full-length 
CAP can assume a conformation that does not, absent 
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some post-translational modification, bind a CARD- 
containing polypeptide of the invention, due, for 
example, to steric blocking of the binding site. Thus, 
a CAP can be a protein or a polypeptide portion of a 
5 protein that can bind one of the CARD-containing 
polypeptides of the invention. Also, it should be 
recognized that a CAP can be identified by using a 
minimal polypeptide derived from the sequences of the 
CARD-containing polypeptides of the invention, and does 
10 not necessarily require that the full-length molecules 
be employed for identifying such CAPs . 



Since CARD-containing polypeptides can be 
involved in apoptosis, the association of a CAP with a 
CARD-containing polypeptide can affect the sensitivity 
15 or resistance of a cell to apoptosis or can induce or 
block apoptosis induced by external or internal 
stimuli. The identification of various CAPs by use of 
known methods can be used to determine the function of 
these CAPs in cell death or signal transduction 

2 0 pathways controlled by CARD-containing polypeptides, 

allowing for the development of assays that are useful 
for identifying agents that effectively alter the 
association of a CAP with a CARD-containing 
polypeptide. Such agents can be useful for providing 
25 effective therapy for conditions caused, at least in 
part, by insufficient apoptosis, such as a cancer, 
autoimmune disease or certain viral infections. Such 
agents can also be useful for ' providing an effective 
therapy for diseases where excessive apoptosis is known 

3 0 to occur, such as stroke, heart failure, or AIDS. 



Assays of the invention can be used for 
identification of agents that alter the self- 
association of the CARD-containing polypeptides of the 
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invention. Thus, the methods of the invention can be 
used to identify agents that alter the self -association 
of CARD2X, CARD3X, CLAN A, CLAN B, CLAN C, CLAN D, COP- 
1 and COP-2 (set forth in SEQ ID NOS : 12, 188, 97, 99, 
5 101, 103, 86 and 90) via their CARD domains, NB-ARC 
domains, LRR domains, or other, domains within these 
polypeptides . 



The ATP-binding and hydrolysis of the NB-ARC 
domains can be critical for function of a NAC 

10 polypeptide, for example, by altering the 

oligomerization of the NAC. Thus, agents that 
interfere with or enhance ATP or nucleotide binding 
and/or hydrolysis by the NB-ARC domain of a NAC 
polypeptide of the invention, such as CLAN (SEQ ID 

15 NOS: 97, 99, 101 or 103) can also be useful for altering 
the activity of these polypeptides in cells. 

A further embodiment of the invention 
provides a method to identify agents that can 
effectively alter CARD- containing polypeptide activity, 

2 0 for example the ability of CARD- containing polypeptides 

to associate with one or more heterologous proteins . 
Thus, the present invention provides a screening assay 
useful for identifying an effective agent, which can 
alter the association of a CARD-containing polypeptide 
25 with a CARD-associated polypeptide (CAP) , such as a 

heterologous CARD-containing polypeptide. Since CARD- 
containing polypeptides are involved in biochemical 
processes such as apoptosis, the identification of such 
effective agents can be useful for altering the level 

3 0 of a biochemical process such as apoptosis in a cell, 

for example in a cell of a subject having a pathology 
characterized by an increased or decreased level of 
apoptosis . 
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Further, effective agents can be useful for 
alteration of other biochemical process modulated by a 
CARD-containing polypeptide of the invention. 
Additional biochemical processes modulated by CARD- 
5 containing polypeptide include, for example, NF-kB 
induction, cytokine processing, cytokine receptor 
signaling, cJUN N-terminal kinase induction, and 
caspase -mediated proteolysis activation/inhibition, 
transcription, inflammation and cell adhesion. 

10 As used herein, the term "agent" means a 

chemical or biological molecule such as a simple or 
complex organic molecule, a peptide, a peptido-mimetic , 
a polypeptide, a protein or an oligonucleotide that has 
the potential for altering the association of a CARD- 

15 containing polypeptide with a heterologous protein or 
altering the ability of a CARD-containing polypeptide 
to self -associate or altering the ligand binding or 
catalytic activity of a CARD-containing polypeptide. 
An exemplary ligand binding activity is nucleotide 

20 binding activity, such as ADP or ATP binding activity; 
and exemplary catalytic activities are nucleotide 
hydrolytic activity and proteolytic activity. In 
addition, the term "effective agent" is used herein to 
mean an agent that is confirmed as capable of altering 

25 the association of a CARD-containing polypeptide with a 
heterologous protein or altering the ability of a CARD- 
containing polypeptide to self -associate or altering 
the ligand binding or catalytic activity of a CARD- 
containing polypeptide. For example, an effective 

30 agent may be an anti-CARD antibody, a CARD-associated 
polypeptide, a caspase inhibitor, and the like. 

As used herein, the term "alter the 
association" means that the association between two 
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specifically interacting polypeptides either is 
increased or decreased due to the presence of an 
effective agent. As a result of an altered association 
of CARD -containing polypeptide with another polypeptide 
5 in a cell, the activity of the CARD- containing 

polypeptide or the CAP can be increased or decreased, 
thereby altering a biochemical, process, for example, 
the level of apoptosis in the cell. As used herein, 
the term "alter the activity" means that the agent can 

10 increase or decrease the activity of a CARD- containing 
polypeptide in a cell, thereby modulating a biochemical 
process in a cell, for example, the level of apoptosis 
in the cell. Similarly, the term "alter the level" of 
a biological process modulated by a CARD -containing 

15 polypeptide refers to an increase or decrease a 

biochemical process which occurs upon altering the 
activity of a CARD- containing polypeptide. For 
example, an effective agent can increase or decrease 
the CARD: CARD- associating activity of a CARD- containing 

20 polypeptide, which can result in decreased apoptosis. 
In another example, alteration of the ATP hydrolysis 
activity can modulate the ability of the NB-ARC domain 
of a CARD-containing polypeptide to associate with 
other NB-ARC -containing polypeptides, such as Apaf-1, 

25 thereby altering any process effected by such 

association between a CARD-containing polypeptide and 
an NB-ARC-containing polypeptide. 

An effective agent can act by interfering 
with the ability of a CARD-containing polypeptide to 
3 0 associate with another polypeptide, or can act by 
causing the dissociation of a CARD-containing 
polypeptide from a complex with a CARD-associated 
polypeptide, wherein the ratio of bound CARD-containing 
polypeptide to free CARD-containing polypeptide is 
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related to the level of a biochemical process, such as, 
apoptosis, in a cell. For example, binding of a ligand 
to a CAP can allow the CAP, in turn, to bind a specific 
CARD- containing polypeptide such that all of the 
5 specific CARD-containing polypeptide is bound to a CAP, 
and can result in decreased apoptosis. The 
association, for example, of a CARD-containing 
polypeptide and a CARD-containing polypeptide can 
result in activation or inhibition of the NB-ARC : NB- 

10 ARC-associating activity of a CARD-containing 

polypeptide. In the presence of an effective agent, 
the association of a CARD-containing polypeptide and a 
CAP can be altered, which can, for example, alter the 
activation of caspases in the cell. As a result of the 

15 altered caspase activation, the level of apoptosis in a 
cell can be increased or decreased. Thus, the 
identification of an effective agent that alters the 
association of a CARD-containing polypeptide with 
another polypeptide can allow for the use of the 

20 effective agent to increase or decrease the level of a 
biological process such as apoptosis. 



An effective agent can be useful, for 
example, to increase the level of apoptosis in a cell 
such as a cancer cell, which is characterized by having 

2 5 a decreased level of apoptosis as compared to its 

normal cell counterpart. An effective agent also can 
be useful, for example, to decrease the level of 
apoptosis in a cell such as a T lymphocyte in a subject 
having a viral disease such as acquired 

3 0 immunodeficiency syndrome, which is characterized by an 

increased level of apoptosis in an infected T cell as 
compared to a normal T cell. Thus, an effective agent 
can be useful as a medicament for altering the level of 
apoptosis in a subject having a pathology characterized 
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by increased or decreased apoptosis. In addition, an 
effective agent can be used, for example, to decrease 
the level of apoptosis and, therefore, increase the 
survival time of a cell such as a hybridoma cell in 
5 culture. The use of an effective agent to prolong the 
survival of a cell in -vitro can significantly improve 
bioproduction yields in industrial tissue culture 
applications . 



A CARD -containing polypeptide that lacks the 
10 ability to bind the NB-ARC domain or LRR domain of 
another polypeptide but retains the ability to 
self -associate via its CARD domain or to bind to other 
CARD- containing polypeptides is an example of an 
effective agent, since the expression of a non-NB-ARC- 
15 associating or non-catalytically active CARD- containing 
polypeptide in a cell can alter the association of a 
the endogenous CARD -containing polypeptide with itself 
or with CAPs. 



Thus, it should be recognized that a mutation 
20 of a CARD -containing polypeptide can be an effective 
agent, depending, for example, on the normal levels of 
. CARD -containing polypeptide and CARD-associated 
polypeptide that occur in a particular cell type. In 
addition, an active fragment of a CARD- containing 
25 polypeptide can be an effective agent, provided the 
active fragment can alter the association of a CARD- 
containing polypeptide and another polypeptide in a 
cell. Such active fragments, which can be peptides as 
small as about five amino acids, can be identified, for 
30 example, by screening a peptide library (see, for 

example, Ladner et al . , U.S. Patent No: 5,223,409) to 
identify peptides that can bind a CARD-associated 
polypeptide. 
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Similarly, a fragment of a CARD-associated 
polypeptide also can be an effective agent. A fragment 
of CARD-associated polypeptide can be useful, for 
example, for decreasing the association of a CARD- 
5 containing polypeptide with a CAP in a cell by 
competing for binding to the CARD-containing 
polypeptide. A non-naturally occurring peptido-mimetic 
also can be useful as an effective agent. Such a 
peptido-mimetic can include, for example, a peptoid, 
10 which is peptide-like sequence containing N-substi tuted 
glycines, or an oligocarbamate . A peptido-mimetic can 
be particularly useful as an effective agent due, for 
example, to having an increased stability to enzymatic 
degradation in vivo. 



15 In accordance with another embodiment of the 

present invention, there is provided a method of 
identifying an effective agent that alters the 
association of an invention CARD-containing polypeptide 
with a CARD-associated polypeptide (CAP) , by the steps 

20 of: 

(a) contacting a CARD-containing 
polypeptide and a CAP polypeptide, under 
conditions that allow the CARD-containing 
polypeptide and CAP polypeptide to associate, 

25 with an agent suspected of being able to 

alter the association of the CARD-containing 
polypeptide and CAP polypeptides; and 

(b) detecting the altered association 
of the CARD-containing polypeptide and CAP 

30 polypeptide, where the altered association 

identifies an effective agent. 
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Methods well-known in the art for detecting 
the altered association of the CARD- containing 
polypeptide and CAP polypeptides, for example, 
measuring protein: protein binding, protein degradation 
5 or apoptotic activity can be employed in bioassays 
described herein to identify agents as agonists or 
antagonists of CARD -containing polypeptides. As 
described herein, CARD -containing polypeptides have the 
ability to self -associate . Thus, methods for 
10 identifying effective agents that alter the association 
of a CARD -containing polypeptide with a CAP are useful 
for identifying effective agents that alter the ability 
of a CARD -containing .polypeptide to self -associate . 



As used herein, "conditions that allow said 
15 CARD- containing polypeptide and CAP polypeptide to 

associate" refers to environmental conditions in which 
a CARD- containing polypeptide and CAP specifically 
associate. Such conditions will typically be aqueous 
conditions, with a pH between 3.0 and 11.0, and 
20 temperature below 100°C. Preferably, the conditions 
will be aqueous conditions with salt concentrations 
below the equivalent of 1 M NaCl , and pH between 5.0 
and 9.0, and temperatures between 0°C and 50°C. Most 
preferably, the conditions will range from 
25 physiological conditions, of normal yeast or mammalian 
cells, or conditions favorable for carrying out in 
vitro assays such as immunoprecipitation and GST 
protein: protein association assays, and the like. 
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In another embodiment of the invention, a 
method is provided for identifying agents that modulate 
a ligand binding or catalytic activity of an invention 
CARD- containing polypeptide. The method contains the 
5 steps of contacting an invention CARD- containing 
polypeptide with an agent suspected of modulating a 
ligand binding or catalytic activity of the CARD- 
containing polypeptide and measuring a ligand binding 
or catalytic activity of the CARD -containing 
10 polypeptide, where modulated ligand binding or 

catalytic activity identifies the agent as an agent 
that alters the ligand binding or catalytic activity of 
a CARD- containing polypeptide. 

As used herein in regard to ligand binding or 
15 catalytic activity, "modulate" refers to an increase or 
decrease in ligand binding or catalytic activity. 
Thus, modulation encompasses inhibition of ligand 
binding or catalytic activity as well as activation or 
enhancement of ligand binding or catalytic activity. 
2 0 Exemplary ligand binding activities include nucleotide 
binding activity. Exemplary catalytic binding 
activities include nucleotide hydrolysis and 
proteolysis activities. 

Methods for measuring ligand binding or 
25 catalytic activities are well known in the art, as 
disclosed herein. For example, an agent known or 
suspected of modulating ligand binding or catalytic 
activity can be contacted with an invention CARD- 
containing polypeptide in vivo or in vitro, and the 
30 ligand binding or catalytic activity can be measured 
using known methods. For example, enzymatic activity 
can be measured using a cleavable reporter, where the 
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cleavable reporter generates or alters a measurable 
signal such as absorption, fluorescence or radioactive 
decay. Exemplary agents that can modulate ligand 
binding or catalytic activity include peptides, 
5 peptidomimetics and other peptide analogs, non-peptide 
organic molecules such as naturally occuring protease 
inhibitors and derviatives thereof, nucleotides and 
nucleotide analogs, and the like. Such inhibitors can 
be either reversible or irreversible, as is well known 
10 in the art. 



Agents that modulate the ligand binding or 
catalytic activity of a CARD- containing polypeptide 
identified using the invention methods can be used to 
modulate the activity of a CARD- containing polypeptide. 

15 For example, and agent can modulate the nucleotide 

binding or nucleotide hydrolytic activity of an NB-ARC 
domain of a CARD- containing polypeptide. In another 
example, an agent can modulate the catalytic activity 
of a protease domain such as a caspase domain. Methods 

2 0 of modulating the ligand binding or catalytic 

activities of invention CARD- containing proteins can be 
used in methods of altering biochemical processes 
modulated by CARD- containing proteins, such as the 
biochemical processes disclosed herein. 



25 In yet another embodiment of the present 

invention, there are provided methods for altering 
ligand binding or catalytic activity of a CARD- 
containing polypeptide of the invention, the method 
comprising: 

30 contacting an CARD- containing polypeptide 

with an effective amount of an agent identified by the 
herein- described bioassays. 
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The present invention also provides in vitro 
screening assays. Such screening assays are 
particularly useful in that they can be automated, 
which allows for high through-put screening, for 
5 example, of randomly or rationally designed agents such 
as drugs, pept idomimetics or peptides in order to 
identify those agents that effectively alter the 
association of a CARD- containing polypeptide and a CAP 
or the catalytic or ligand binding activity of a CARD- 

10 containing polypeptide and, thereby, alter a 

biochemical process modulated by a CARD- containing 
polypeptide such as apoptosis. An in vitro screening 
assay can utilize, for example, a CARD-containing 
polypeptide including a CARD-containing fusion protein 

15 such as a CARD-glutathione-S-transf erase fusion 

protein. For use in the in vitro screening assay, the 
CARD-containing polypeptide should have an affinity for 
a solid substrate as well as the ability to associate 
with a CARD-associated polypeptide. For example, when 

20 a CARD-containing polypeptide is used in the assay, the 
solid substrate can contain a covalently attached 
anti-CARD antibody. Alternatively, a GST/ CARD fusion 
protein can be used in the assay and the solid 
substrate can contain covalently attached glutathione, 

25 which is bound by the GST component -of the GST/ CARD 
fusion protein. Similarly, a CARD-associated 
polypeptide, or GST/NB- ARC- containing polypeptide 
fusion protein can be used in any of a variety of in 
vitro enzymatic or in vitro binding assays known in the 

3 0 art and described in texts such as Ausubel et al . , 
supra , 2000. 
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An in vitro screening assay can be performed 
by allowing a CARD-containing polypeptide, for example, 
to bind to the solid support, then adding a CARD- 
associated polypeptide and an agent to be tested. 
5 Reference reactions, which do not contain an agent, can 
be performed in parallel. Following incubation under 
suitable conditions, which include, for example, an 
appropriate buffer concentration and pH and time and 
temperature that permit binding of the particular CARD- 

10 containing polypeptide and CARD-associated polypeptide, 
the amount of protein that has associated in the 
absence of an agent and in the presence of an agent can 
be determined. The association of a CARD- associated 
polypeptide with a CARD-containing polypeptide can be 

15 detected, for example, by attaching a detectable moiety 
such as a radionuclide or a fluorescent label to a 
CARD- associated polypeptide and measuring the amount of 
label that is associated with the solid support, 
wherein the amount of label detected indicates the 

2 0 amount of association of the CARD-associated 

polypeptide with a CARD-containing polypeptide. An 
effective agent is determined by comparing the amount 
of specific binding in the presence of an agent as 
compared to a reference level of binding, wherein an 
25 effective agent alters the association of CARD- 
containing polypeptide with the CARD-associated 
polypeptide. Such an assay is particularly useful for 
screening a panel of agents such as a peptide library 
in order to detect an effective agent. 

3 0 Various binding assays to identify cellular 

proteins that interact with protein binding domains are 
known in the art and include, for example, yeast 
two-hybrid screening assays (see, for example, U.S. 
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Patent Nos . 5,283,173, 5,468,614 and 5,667,973; Ausubel 
et al . , supra , 2 000; Luban et al . , Curr . Op in. 
Biotechnol . 6:59-64 (1995)) and affinity column 
chromatography methods using cellular extracts. By 
5 synthesizing or expressing polypeptide fragments 
containing various CARD-associat ing sequences or 
deletions, the CARD binding interface can be readily 
identified . 



Another assay for screening of agents that 
10 alter the activity of a CARD- containing polypeptide is 
based on altering the phenotype of yeast by expressing 
a CARD- containing polypeptide. In one embodiment, 
expression of a CARD-containing polypeptide can be 
inducible (Tao et al . , J. Biol. Chem. 273:23704-23708 
15 (1998) , and the compounds can be screened when CARD- 
containing polypeptide expression is induced. CARD- 
containing polypeptides of the invention can also be 
co- expressed in yeast with CAP polypeptides used to 
screen for compounds that antagonize the activity of 
20 the CARD-containing polypeptide. 

Also provided with the present invention are 
assays to identify agents that alter CARD-containing 
polypeptide expression. Methods to determine CARD- 
containing polypeptide expression can involve detecting 

25 a change in CARD-containing polypeptide abundance in 
response to contacting the cell with an agent that 
modulates CARD-containing polypeptide expression. 
Assays for detecting changes in polypeptide expression 
include, for example, immunoassays with CARD- specific 

3 0 antibodies, such as immunoblotting, immunofluorescence, 
immunohistochemistry and immunoprecipitation assays, as 
described herein. 
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As understood by those of skill in the art, 
assay methods for identifying agents that alter CARD- 
containing polypeptide activity generally require 
comparison to a reference. One type of a "reference" 
5 is a cell or culture that is treated substantially the 
same as the test cell or test culture exposed to the 
agent, with the distinction that the "reference" cell 
or culture is not exposed to the agent. Another type 
of "reference" cell or culture can be a cell or culture 

10 that is identical to the test cells, with the exception 
that the "reference" cells or culture do not express a 
CARD -containing polypeptide. Accordingly, the response 
of the transf ected cell to an agent is compared to the 
response, or lack thereof, of the "reference" cell or 

15 culture to the same agent under the same reaction 
conditions. 



Methods for producing pluralities of agents 
to use in screening for compounds that alter the 
activity of a CARD- containing polypeptide, including 
2 0 chemical or biological molecules such as simple or 

complex organic molecules, metal -containing compounds, 
carbohydrates, peptides, proteins, peptidomimetics, 
glycoproteins, lipoproteins, nucleic acids, antibodies, 
and the like, are well known in the art and are 

2 5 described, for example, in Huse, U.S. Patent No. 

5,264,563; Francis et al . , Curr. Qpin. Chem. Biol. 
2:422-428 (1998); Tietze et al . , Curr. Biol. . 2:363-371 
(1998); Sofia, MoT. Divers. 3:75-94 (1998>; Eichler et 
al., Med. Res. Rev. 15:481-496 (1995); and the like. 

3 0 Libraries containing large numbers of natural and 

synthetic agents also can be obtained from commercial 
sources. Combinatorial libraries of molecules can be 
prepared using well known combinatorial chemistry 
methods (Gordon et al . , J. Med. Chem. 37: 1233-1251 

i 
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(1994); Gordon et al . , J . Med. Chem. 37: 1385-1401 
(1994); Gordon et al . , Acc . Chem. Res. 29:144-154 
(1996); Wilson and Czarnik, eds . , Combl na t or i a 1 
Chemistry: Synthesis and Application , John Wiley Sc 
5 Sons, New York (1997)) . 

The invention further provides a method of 
diagnosing or predicting clinical prognosis of a 
pathology characterized by an increased or decreased 
level of a CARD -containing polypeptide in a subject. 

10 The method includes the steps of (a) obtaining a test 

sample from the subject; (b) contacting the sample with 
an agent that can bind a CARD- containing polypeptide of 
the invention under suitable conditions, wherein the 
conditions allow specific binding of the agent to the 

15 CARD- containing polypeptide; and (c) comparing the 

amount of the specific binding in the test sample with 
the amount of specific binding in a reference sample, 
wherein an increased or decreased amount of the 
specific binding in the test sample as compared to the 

2 0 reference sample is diagnostic of, or predictive of the 

clinical prognosis of, a pathology. The agent can be, 
for example, an anti-CARD antibody, a CARD-associated- 
polypeptide (CAP) , or a CARD-encoding nucleic acid. 

Exemplary pathologies for diagnosis or the 
25 prediction of clinical prognosis include any of the 
pathologies described herein, such as neoplastic 
pathologies (e.g. cancer), autoimmune diseases, and 
other pathologies related to abnormal cell 
proliferation or abnormal cell death (e.g. apoptosis) , 

3 0 as disclosed herein. 

The invention also provides a method of 
diagnosing cancer or monitoring cancer therapy by 
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contacting a test sample from a patient with a CARD- 
specific antibody. The invention additionally provides 
a method of assessing prognosis (e.g., predicting the 
clinical prognosis) of patients with cancer comprising 
5 contacting a test sample from a patient with a CARD- 
specific antibody. 

The invention additionally provides a method 
of diagnosing cancer or monitoring cancer therapy by 
contacting a test sample from a patient with a 

10 oligonucleotide that selectively hybridizes to a CARD- 
encoding nucleic acid molecule. The invention further 
provides a method of assessing prognosis (e.g., 
predicting the clinical prognosis) of patients with 
cancer by contacting a test sample from a patient with 

15 a oligonucleotide that selectively hybridizes to a 
CARD-encoding nucleic acid molecule. 



The methods of the invention for diagnosing 
cancer or monitoring cancer therapy using a CARD- 
specific antibody or oligonucleotide or nucleic acid 

2 0 that selectively hybridizes to a CARD-encoding nucleic 
acid molecule can be used, for example, to segregate 
patients into a high risk group or a low risk group for 
diagnosing cancer or predicting risk of metastasis or 
risk of failure to respond to therapy. Therefore, the 

2 5 methods of the invention can be advantageously used to 
determine, for example, the risk of metastasis in a 
cancer patient, or the risk of an autoimmune disease of 
a patient, or as a prognostic indicator of survival or 
disease progression in a cancer patient or patient with 

30 an autoimmune disease. One of ordinary skill in the 
art would appreciate that the prognostic indicators of 
survival for cancer patients suffering from stage I 
cancer can be different from those for cancer patients 
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suffering from stage IV cancer. For example, prognosis 
for stage I cancer patients can be oriented toward the 
likelihood of continued growth and/or metastasis of the 
cancer, whereas prognosis for stage IV cancer patients 
5 can be oriented toward the likely effectiveness of 
therapeutic methods for treating the cancer. 
Accordingly, the methods of the invention directed to 
measuring the level of or determining the presence of a 
CARD- containing polypeptide or CARD -encoding nucleic 
10 acid can be used advantageously as a prognostic 

indicator for the presence or progression of a cancer 
or response to therapy. 



The invention further provides methods for 
introducing a CARD-encoding nucleic acid into a cell in 

15 a subject, for example, for gene therapy. Viruses are 
specialized infectious agents that can elude host 
defense mechanisms and can infect and propagate in 
specific cell types. Viral based systems provide the 
advantage of being able to introduce relatively high 

2 0 levels of the heterologous nucleic acid into a variety 
of cells. Suitable viral vectors for introducing an 
invention CARD-encoding nucleic acid into mammalian 
cells (e.g., vascular tissue segments) are well known 
in the art. These viral vectors include, for example, 

25 Herpes simplex virus vectors (e.g., Geller et al . , 

Science , 241:1667-1669 (1988)), Vaccinia virus vectors 
(e.g., Piccini et al . , Meth. in Enzymology , 153:545-563 
(1987); Cytomegalovirus vectors (Mocarski et al . , in 
Viral Vectors , Y. Gluzman and S.H. Hughes, Eds., Cold 

30 Spring Harbor Laboratory, Cold Spring Harbor, N.Y., 
1988, pp. 78-84), Moloney murine leukemia virus 
vectors (Danos et_al . , Proc. Natl. Acad. Sci., USA , 
85:6469 (1980)), adenovirus vectors (e.g., Logan et 
al., Proc. Natl. Acad. Sci., USA , 81:3655-3659 (1984); 
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Jones et al . , Cell . 17:683-689 (1979); Berkner, 
Biotechniaues . 6:616-626 (1988); Cotten et al . , Proc . 
Natl. Acad. Sci. . USA. 89:6094-6098 (1992); Graham et 
al./ Meth. Mol . Biol . . 7:109-127 (1991)), 
5 adeno-associated virus vectors, retrovirus vectors 

(see, e.g., U.S. Patent 4,405,712 and 4 , 650 , 764 ) , and 
the like. Especially preferred viral vectors are the 
adenovirus and retroviral vectors. 



Suitable retroviral vectors for use herein 
10 are described, for example, in U.S. Patent 5,252,479, 
and in WIPO publications WO 92/07573, WO 90/06997, WO 
89/05345, WO 92/05266 and WO 92/14829, incorporated 
herein by reference, which provide a description of 
methods for efficiently introducing nucleic acids into 
15 human cells using such retroviral vectors. Other 
retroviral vectors include, for example, the mouse 
mammary tumor virus vectors (e.g., Shackleford et al . , 
Proc. Natl. Acad . Sci. USA . 85:9655-9659 (1988)), and 
the like. 



20 In particular, the specificity of viral 

vectors for particular cell types can be utilized to 
target predetermined cell types. Thus, the selection 
of a viral vector will depend, in part, on the cell 
type to be targeted. For example, if a 

2 5 neurodegenerative disease is to be treated by 

increasing the level of a CARD- containing polypeptide 
in neuronal cells affected by the disease, then a viral 
vector that targets neuronal cells can be used. A 
vector derived from a herpes simplex virus is an 

3 0 example of a viral vector that targets neuronal cells 

(Battleman et al., J. Neurosci. 13:941-951 (1993), 
which is incorporated herein by reference) . Similarly, 
if a disease or pathological condition of the 
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hematopoietic system is to be treated, then a viral 
vector that is specific for a particular blood cell or 
its precursor cell can be used. A vector based on a 
human immunodeficiency virus is an example of such a 
5 viral vector (Carroll et al . , J. Cell, Biochem. 17E:241 
(1993), which is incorporated herein by reference). In 
addition, a viral vector or other vector can be 
constructed to express a CARD-encoding nucleic acid in 
a tissue specific manner by incorporating a 
10 tissue-specific promoter or enhancer into the vector 

(Dai et al . , Proc . Natl. Acad. Sci . USA 89:10892-10895 
(1992) , which is incorporated herein by reference) . 



For gene therapy, a vector containing a CARD- 
encoding nucleic acid or an antisense nucleotide 

15 sequence can be administered to a subject by various 
methods. For example, if viral vectors are used, 
administration can take advantage of the target 
specificity of the vectors. In such cases, there in no 
need to administer the vector locally at the diseased 

20 site. However, local administration can be a 

particularly effective method of administering a CARD- 
encoding nucleic acid. In addition, administration can 
be via intravenous or subcutaneous injection into the 
subject. Following injection, the viral vectors will 

25 circulate until they recognize host cells with the 
appropriate target specificity for infection. 
Injection of viral vectors into the spinal fluid also 
can be an effective mode of administration, for 
example, in treating a neurodegenerative disease. 



Receptor-mediated DNA delivery approaches 
also can be used to deliver a CARD-encoding nucleic - 
acid molecule into cells in a tissue-specific manner 
using a tissue-specific ligand or an antibody that is 
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non-covalently complexed with the nucleic acid molecule 
via a bridging molecule (Curiel et al . , Hum. Gene Ther. 
3:147-154 (1992); Wu and Wu, J. Biol. Chettu 
262:4429-4432 (1987), each of which is incorporated 
5 herein by reference) . Direct injection of a naked or a 
nucleic acid molecule encapsulated, for example, in 
cationic liposomes also can be used for stable gene 
transfer into non-dividing or dividing cells in vivo 
(Ulmer et al . , Science 259:1745-1748 (1993), which is 

10 incorporated herein by reference) . In addition, a 

CARD-encoding nucleic acid molecule can be transferred 
into a variety of tissues using the particle 
bombardment method (Williams et al . , Proc. Natl. Acad. 
Sci. USA 88:2726-2730 (1991), which is incorporated 

15 herein by reference) . Such nucleic acid molecules can 
be linked to the appropriate nucleotide sequences 
required for transcription and translation. 



A particularly useful mode of administration 
of a CARD-encoding nucleic acid is by direct 

2 0 inoculation locally at the site of the disease or 

pathological condition. Local administration can be 
advantageous because there is no dilution effect and, 
therefore, the likelihood that a majority of the 
targeted cells will be contacted with the nucleic acid 
25 molecule is increased. Thus, local inoculation can 
alleviate the targeting requirement necessary with 
other forms of administration and, if desired, a vector 
that infects all cell types in the inoculated area can 
be used. If expression is desired in only a specific 

3 0 subset of cells within the inoculated area, then a 

promoter, an enhancer or other expression element 
specific for the desired subset of cells can be linked 
to the nucleic acid molecule. Vectors containing such 
nucleic acid molecules and regulatory elements can be 
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viral vectors, viral genomes, plasmids, phagemids and 
the like. Transfection vehicles such as liposomes also 
can be used to introduce a non-viral vector into 
recipient cells. Such vehicles are well known in the 
5 art . 

The present invention also provides 
therapeutic compositions useful for practicing the 
therapeutic methods described herein. Therapeutic 
compositions of the present invention, such as 

10 pharmaceutical compositions, contain a physiologically 
compatible carrier together with an invention CARD- 
containing polypeptide (or functional fragment 
thereof) , an invention CARD- encoding nucleic acid, an 
agent that alters CARD activity or expression 

15 identified by the methods described herein, or an 

anti-CARD antibody, as described herein, dissolved or 
dispersed therein as an active ingredient. In a 
preferred embodiment, the therapeutic composition is 
not immunogenic when administered to a mammal or human 

20 patient for therapeutic purposes. 

As used herein, the terms "pharmaceutically 
acceptable" , "physiologically compatible" and 
grammatical variations thereof, as they refer to 
compositions, carriers, diluents and reagents, are used 
2 5 interchangeably and represent that the materials are 
capable of administration to a mammal without the 
production of undesirable physiological effects. 

The preparation of a pharmacological 
composition that contains active ingredients dissolved 
30 or dispersed therein is well known in the art. 

Typically such compositions are prepared as injectibles 
either as liquid solutions or suspensions; however, 
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solid forms suitable for solution, or suspension, in 
liquid prior to use can also be prepared. The 
preparation can also be emulsified. 

The active ingredient can be mixed with 
5 excipients which are pharmaceutically acceptable and 
compatible with the active ingredient in amounts 
suitable for use in the therapeutic methods described 
herein. Suitable excipients are, for example, water, 
saline, dextrose, glycerol, ethanol, or the like, as 

10 well as combinations of any two or more thereof. In 

addition, if desired, the composition can contain minor 
amounts of auxiliary substances such as wetting or 
emulsifying agents, pH buffering agents, and the like, 
which enhance the effectiveness of the active 

15 ingredient. 

The therapeutic composition of the present 
invention can include pharmaceutically acceptable salts 
of the components therein. Pharmaceutically acceptable 
nontoxic salts include the acid addition salts (formed 

2 0 with the free amino groups of the polypeptide) that are 
formed with inorganic acids such as, for example, 
hydrochloric acid, hydrobromic acid, perchloric acid, 
nitric acid, thiocyanic acid, sulfuric acid, phosphoric 
acid, acetic acid, propionic acid, glycolic acid, 

25 lactic acid, pyruvic acid, oxalic acid, malonic acid, 
succinic acid, maleic acid, fumaric acid, anthranilic 
acid, cinnamic acid, naphthalene sulfonic acid, 
sulfanilic acid, and the like. 

Salts formed with the free carboxyl groups 
30 can also be derived from inorganic bases such as, for 
example, sodium hydroxide, ammonium hydroxide, 
potassium hydroxide, and the like; and organic bases 
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such as mono-, di-, and tri-alkyl and -aryl amines 
(e.g., triethylamine, diisopropyl amine, methyl amine, 
dimethyl amine, and the like) and optionally 
substituted ethanolamines (e.g., ethanolamine , 
5 diethanolamine, and the like) . 

Physiologically tolerable carriers are well 
known in the art. Exemplary liquid carriers are 
sterile aqueous solutions that contain no materials in 
addition to the active ingredients and water, or 

10 contain a buffer such as sodium phosphate at 

physiological pH, physiological saline or both, such as 
phosphate-buf f ered saline. Still further, aqueous 
carriers can contain more than one buffer salt, as well 
as salts such as sodium and potassium chlorides, 

15 dextrose, polyethylene glycol and other solutes. 

Liquid compositions can also contain liquid 
phases in addition to and to the exclusion of water. 
Exemplary additional liquid phases include glycerin, 
vegetable oils such as cottonseed oil, and water-oil 
20 emulsions. 

As described herein, an "effective amount' 7 is 
a predetermined amount calculated to achieve the 
desired therapeutic effect, i.e., to alter the protein 
binding activity of a CARD- containing polypeptide or 

25 the catalytic activity of a CARD- containing 

polypeptide, resulting in altered biochemical process 
modulated by a CARD -containing polypeptide. The 
required dosage will vary with the particular treatment 
and with the duration of desired treatment; however, it 

30 is anticipated that dosages between about 10 micrograms 
and about 1 milligram per kilogram of body weight per 
day will be used for therapeutic treatment. It may be 
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particularly advantageous to administer such agents in 
depot or long- lasting form as discussed herein. A 
therapeutically effective amount is typically an amount 
of an agent identified herein that, when administered 
5 in a physiologically acceptable composition, is 

sufficient to achieve a plasma concentration of from 
about 0.1 ug/ml to about 100 yg/ml, preferably from 
about 1.0 ug/ml to about 50 ug/ml, more preferably at 
least about 2 ug/ml and usually 5 to 10 ug/ml. 
10 Therapeutic invention anti-CARD antibodies can be 

administered in proportionately appropriate amounts in 
accordance with known practices in this art. 

Also provided herein are methods of treating 
pathologies characterized by abnormal cell 
15 proliferation, abnormal cell death, or inflammation 
said method comprising administering an effective 
amount of an invention therapeutic composition. Such 
compositions are typically administered in a 
physiologically compatible composition. 



2 0 Exemplary abnormal cell proliferation 

diseases associated with CARD- containing polypeptides 
contemplated herein for treatment according to the 
present invention include cancer pathologies, 
keratinocyte hyperplasia, neoplasia, keloid, benign 
25 prostatic hypertrophy, inflammatory hyperplasia, 

fibrosis, smooth muscle cell proliferation in arteries 
following balloon angioplasty (restenosis) , and the 
like. Exemplary cancer pathologies contemplated herein 
for treatment include, gliomas, carcinomas, 

3 0 adenocarcinomas, sarcomas, melanomas, hamartomas, 

leukemias, lymphomas, and the like. Further diseases 
associated with CARD -containing polypeptides 
contemplated herein for treatment according to the 
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present invention include inflammatory diseases and 
diseases of cell loss. Such diseases include 
allergies, inflammatory diseases including arthritis, 
lupus, Schrogen's syndrome, Crohn's disease, ulcerative 
5 colitis, as well as allograft rejection, such as graft - 
versus-host disease, and the like. CARD-containing 
polypeptides can also be useful in design of strategies 
for preventing diseases related to abnormal cell death 
in conditions such as stroke, myocardial infarction, 
10 heart failure, neurodegenerative diseases such as 
Parkinson's and Alzheimer's diseases, and for 
immunodeficiency associated diseases such as HIV 
infection, HIV-related disease, and the like. 



Methods of treating pathologies can include 

15 methods of modulating the activity of one or more 
oncogenic proteins, wherein the oncogenic proteins 
specifically interact with a CARD- containing 
polypeptide of the invention. Methods of modulating 
the activity of such oncogenic proteins will include 

2 0 contacting the oncogenic protein with a substantially 
pure CARD-containing polypeptide or an active fragment 
(i.e. , oncogenic protein-binding fragment) thereof . 
This contacting will alter the activity of the 
oncogenic protein,, thereby providing a method of 

25 treating a pathology caused by the oncogenic protein. 

Further methods of modulating the activity of oncogenic 
proteins will include contacting the oncogenic protein 
with an agent, wherein the agent alters interaction 
between a CARD-containing polypeptide and an oncogenic 

30 protein. 



Also contemplated herein, are therapeutic 
methods using invention pharmaceutical compositions for 
the treatment of pathological disorders in which there 



WO 01/90156 PCT/US01/17158 

93 

is too little cell division, such as, for example, bone 
marrow aplasias, immunodeficiencies due to a decreased 
number of lymphocytes, and the like. Methods of 
treating a variety of inflammatory diseases with 
5 invention therapeutic compositions are also 

contemplated herein, such as treatment of sepsis, 
fibrosis (e.g., scarring), arthritis, graft versus host 
disease, and the like. 



The present invention also provides methods 

10 for diagnosing a pathology that is characterized by an 
increased or decreased level of a biochemical process 
to determine whether the increased or decreased level 
of the biochemical process is due, for example, to 
increased or decreased expression of a CARD- containing 

15 polypeptide or to expression of a variant CARD- 

containing polypeptide. As disclosed herein, such 
biochemical processes include apoptosis, NF-kB 
induction, cytokine processing, caspase-mediated 
proteolysis, transcription, inflammation, cell 

20 adhesion, and the like. The identification of such a 
pathology, which can be due to altered association of a 
CARD- containing polypeptide with a CARD-associated 
polypeptide in a cell, or altered ligand binding or 
catalytic activity of a CARD-containing polypeptide, 

25 can allow for intervention therapy using an effective 
agent or a nucleic acid molecule or an antisense 
nucleotide sequence as described herein. In general, a 
test sample can be obtained from a subject having a 
pathology characterized by having or suspected of 

30 having increased or decreased apoptosis and can be 

compared to a reference sample from a normal subject to 
determine whether a cell in the test sample has, for 
example, increased or decreased expression of a CARD- 
encoding gene. The level of a CARD- containing 
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polypeptide in a cell can be determined by contacting a 
sample with a reagent such as an ant i- CARD antibody or 
a CARD-associated polypeptide, either of which can 
specifically bind a CARD-containing polypeptide. For 
5 example, the level of a CARD-containing polypeptide in 
a cell can determined by well known immunoassay or 
immunohistochemical methods using an ant i - CARD antibody 
(see, for example, Reed et al . , Anal . Biochem. 205:70- 
76 (1992); see, also, Harlow and Lane, supra , (1988)). 

10 As used herein, the term "reagent" means a chemical or 
biological molecule that can specifically bind to a 
CARD-containing polypeptide or to a bound CARD/CARD- 
associated polypeptide complex. For example, either an 
anti-CARD antibody or a CARD-associated polypeptide can 

15 be a reagent for a CARD-containing polypeptide, whereas 
either an anti-CARD antibody or an ant i -CARD-associated 
polypeptide antibody can be a reagent for a CARD/CARD- 
associated polypeptide complex. 



As used herein, the term "test sample" means 

2 0 a cell or tissue specimen that is obtained from a 

subject and is to be examined for expression of a CARD- 
encoding gene in a cell in the sample. A test sample 
can be obtained, for example, during surgery or by 
needle biopsy and can be examined using the methods 
25 described herein to diagnose a pathology characterized 
by increased or decreased apoptosis. Increased or 
decreased expression of a CARD-encoding gene in a cell 
in a test sample can be determined, for example, by 
comparison to an expected normal level of CARD- 

3 0 containing polypeptide or CARD-encoding mRNA in a 

particular cell type. A normal range of CARD- 
containing polypeptide or CARD-encoding mRNA levels in 
various cell types can be determined by sampling a 
statistically significant number of normal subjects. 
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In addition, a reference sample can be evaluated in 
parallel with a test sample in order to determine 
whether a pathology characterized by increased or 
decreased apoptosis is due to increased or decreased 
5 expression of a CARD-encoding gene. The test sample 
can be examined using, for example, immunohistochemical 
methods as described above or the sample can be further 
processed and examined. For example, an extract of a 
test sample can be prepared and examined to determine 
10 whether a CARD- containing polypeptide in the sample can 
associate with a CARD-associated polypeptide in the 
same manner as a CARD- containing polypeptide from a 
reference cell or whether, instead, a variant CARD- 
containing polypeptide is expressed in the cell. 

In accordance with another embodiment of the 
present invention, there are provided diagnostic 
systems, preferably in kit form, comprising at least 
one invention CARD- encoding nucleic acid, CARD- 
containing polypeptide, and/or anti-CARD antibody 
described herein, in a suitable packaging material. In 
one embodiment, for example, the diagnostic nucleic 
acids are derived from any of SEQ ID NOS:ll, 187, 96, 
98, 100, 102, 85 and 89. Invention diagnostic systems 
are useful for assaying for the presence or absence of 
CARD-encoding nucleic acid in either genomic DNA or in 
transcribed CARD-encoding nucleic acid, such as mRNA or 
cDNA. 

A suitable diagnostic system includes at 
least one invention CARD-encoding nucleic acid, CARD- 
3 0 containing polypeptide, and/or anti-CARD antibody, 
preferably two or more invention nucleic acids, 
proteins and/or antibodies, as a separately packaged 
chemical reagent (s) in an amount sufficient for at 



15 
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least one assay. Instructions for use of the packaged 
reagent are also typically included. Those of skill in 
the art can readily incorporate invention nucleic acid 
probes and/or primers into kit form in combination with 
5 appropriate buffers and solutions for the practice of 
the invention methods as described herein. 



As employed herein, the phrase "packaging 
material" refers to one or more physical structures 
used to house the contents of the kit, such as 

10 invention nucleic acid probes or primers, and the like. 
The packaging material is constructed by well known 
methods, preferably to provide a sterile, 
contaminant- free environment. The packaging material 
has a label which indicates that the invention nucleic 

15 acids can be used for detecting a particular CARD- 
encoding sequence including the nucleotide sequences 
set forth in SEQ ID NOS:ll, 187, 96, 98, 100, 102, 85 
and 89 or mutations or deletions therein, thereby 
diagnosing the presence of, or a predisposition for a 

20 pathology such as cancer or an autoimmune disease. In 
addition, the packaging material contains instructions 
indicating how the materials within the kit are 
employed both to detect a particular sequence and 
diagnose the presence of, or a predisposition for a 

25 pathology such as cancer or an autoimmune disease. 

The packaging materials employed herein in 
relation to diagnostic systems are those customarily 
utilized in nucleic acid-based diagnostic systems. As 
used herein, the term "package" refers to a solid 
30 matrix or material such as glass, plastic, paper, foil, 
and the like, capable of holding within fixed limits an 
isolated nucleic acid, oligonucleotide, or primer of 
the present invention. Thus, for example, a package 
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can be a glass vial used to contain milligram 
quantities of a contemplated nucleic acid, 
oligonucleotide or primer, or it can be a microtiter 
plate well to which microgram quantities of a 
5 contemplated nucleic acid probe have been operatively 
affixed. 

"Instructions for use" typically include a 
tangible expression describing the reagent 
concentration or at least one assay method parameter, 
10 such as the relative amounts of reagent and sample to 
be admixed, maintenance time periods for reagent/sample 
admixtures, temperature, buffer conditions, and the 
like. 

A diagnostic assay should include a simple 
15 method for detecting the amount of a CARD -containing 
polypeptide or CARD-encoding nucleic acid in a sample 
that is bound to the reagent. Detection can be 
performed by labeling the reagent and detecting the 
presence of the label using well known methods (see, 
20 for example, Harlow and Lane, supra, 1988; chap. 9, for 
labeling an antibody) . A reagent can be labeled with 
various detectable moieties including a radiolabel, an 
enzyme, biotin or a f luorochrome . Materials for 
labeling the reagent can be included in the diagnostic 

2 5 kit or can be purchased separately from a commercial 

source. Following contact of a labeled reagent with a 
test sample and, if desired, a control sample, 
specifically bound reagent can be identified by 
detecting the particular moiety. 

3 0 A labeled antibody that can specifically bind 

the reagent also can be used to identify specific 
binding of an unlabeled reagent. For example, if the 
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reagent is an ant i -CARD antibody, a second antibody can 
be used to detect specific binding of the ant i -CARD 
antibody. A second antibody generally will be specific 
for the particular class of the first antibody. For 
5 example, if an ant i -CARD antibody is of the IgG class, 
a second antibody will be an anti-IgG antibody. Such 
second antibodies are readily available from commercial 
sources. The second antibody can be labeled using a 
detectable moiety as described above. When a sample is 
10 labeled using a second antibody, the sample is first 
contacted with a first antibody, then the sample is 
contacted with the labeled second antibody, which 
specifically binds to the first antibody and results in 
a labeled sample. 

15 In accordance with another embodiment of the 

invention, there are provided methods for determining a 
prognosis of disease free or overall survival in a 
patient suffering from cancer. For example, it is 
contemplated herein that abnormal levels of CARD- 

20 containing polypeptides (either higher or lower) in 
primary tumor tissue show a high correlation with 
either increased or decreased tumor recurrence or 
spread, and therefore indicates the likelihood of 
disease free or overall survival. Thus, the present 

25 invention advantageously provides a significant 
advancement in cancer management because early 
identification of patients at risk for tumor recurrence 
or spread will permit aggressive early treatment with 
significantly enhanced potential for survival. Also 

3 0 provided are methods for predicting the risk of tumor 
recurrence or spread in an individual having a cancer 
tumor; methods for screening a cancer patient to 
determine the risk of tumor metastasis; and methods for 
determining the proper course of treatment for a 



WO 01/90156 PCT/US01/17158 

99 

patient suffering from cancer. These methods are 
carried out by collecting a sample from a patient and 
comparing the level of CARD- encoding gene expression in 
the patient to the level of expression in a control or 
5 to a reference level of CARD- encoding gene expression 
as defined by patient population sampling, tissue 
culture analysis, or any other method known for 
determining reference levels for determination of 
disease prognosis. The level of CARD-encoding gene 
10 expression in the patient is then classified as higher 
than the reference level or lower than the reference 
level, wherein the prognosis of survival or tumor 
recurrence is different for patients with higher levels 
than the prognosis for patients with lower levels. 

15 All U.S. patents and all publications 

mentioned herein are incorporated in their entirety by 
reference thereto. The invention will now be described 
in greater detail by reference to the following 
non-limiting examples. 

20 EXAMPLES 

1.0 Identification of CARD- containing polypeptides. 

The process of gene identification and 
assembling include the following steps: 

A) Identification of new candidate CARD containing 
25 polypeptides. A database search was performed using 
the TBLASTN program with the CARD domain of caspase-1 
and caspase-12 as the query in the following MCBI 
databases : high throughput genome sequence (HTGS) , 
genomic survey sequence (GSS) and expressed sequence 
3 0 tag (EST) databases. 
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B) Verification that the new candidate CARD containing 
polypeptide is novel. Using PSI-BLAST, each new 
candidate CARD gene was queried in the annotated 

non- redundant (NR) database at NCBI . When the new 
5 candidate gene showed significant but not identical 

homology with other known CARD containing polypeptides 
during this search, the CARD containing polypeptide 
candidate was kept for further analysis. 

C) 3-D-Model Building of new candidate CARD 

10 polypeptide: When the sequence homology was low (<25% 
identity) , three-dimensional criteria was added to 
characterization of new CARD- containing polypeptides. 
The candidate CARD fragment was analyzed by a profile - 
profile sequence comparison method which aligns the 

15 candidate CARD domain with a database of sequences of 
known three-dimensional structure. From this analysis, 
a sequence alignment was produced and a three- 
dimensional model was built according to the known 
structure of CARD domain of IAP-1. In most cases, the 

2 0 best score was produced using CARD domain sequences 

having known three-dimensional structures. The quality 
of the three-dimensional model obtained from the 
alignments confirmed that novel CARD-domain containing 
polypeptides had been identified. 

25 

D) Identification of additional domains in the full 
length protein. Full length protein sequences were 
obtained using the closest full-length caspase homolog 
of the new CARD identified in step B as query. TBLASTN 

30 searches of the sequences containing the newly 

identified CARD domains were performed. Longer aligned 
fragments or multiple aligned fragments in the 
accession number corresponding to the newly identified 
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CARD containing polypeptides indicated a longer 
protein. 

E) These additional domains were assembled using the 
following gene building procedure: 



5 Genomic DNA fragments were identified by 

T-BLAST-N analysis using mouse caspase-12 and human 
caspase-1 full length protein as query and scanning 
HTGS database from NCBI of incomplete DNA genomics 
sequences. New fragments homologous to caspase-12 and 
10 caspase-1 were further confirmed by psi-blast analysis 
using the TBLASTN genomic DNA homolog fragment as query 
and scanning NR database. The boundary of each 
fragment was identified according to the following 
criteria : 



15 Disruption of sequence similarity between the 

protein alignment of the target fragment and the query. 

Extension of the protein sequence alignment 
between query and target using ORF finder. 

Protein sequence overlap between two 
20 contiguous fragments in sequence relative to the query. 

Conservation of exon-intron junction between 
DNA sequence of the target and query. 

Orientation of the ORF of the different 
genomic DNA fragment. 



Presence of contiguous fragments, based on 
sequence alignment with the query, on the same contig. 
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Finally, the reconstituted sequences were 
aligned by CLUSTALW with the query and exon-intron 
junctions further refined by repeating the above 
process . 



5 2.0 Identification of CARD2X, CARD 3 X and CLAN. 

Nucleic acids encoding CARD containing proteins CARD2X, 
CARD 3 X and CLAN were identified from different CARD 
queries using tblastn and systematically scanning gss, 
htgs, and all EST databases at NCBI . Further analysis 
10 using translated genomic fragment containing CARD 
domains larger than the CARD domain itself as query 
were performed to identify additional domains. Genomic 
DNA were translated in all reading frames and examined 
for additional domains using psi-blast and nr database. 



15 3.0 Cloning and sequencing of large cDNA . For 

cDNA larger than 1500 bp, cloning is accomplished by 
amplification of multiple fragments of the cDNA. 
Jurkat total RNA is reverse- transcribed to 
complementary DNAs using MMLV reverse transcriptase 

20 (Stratagene) and random hexanucleotide primers. 
Overlapping cDNA fragments of a CARD- containing 
polypeptide are amplified from the Jurkat complementary 
DNAs with Turbo Pfu DNA polymerase (Stratagene) using 
an oligonucleotide primer set for every 1500 bp of 

25 cDNA, where the amplified cDNA fragment contains a 
unique restriction site near the end that is to be 
ligated with an adjacent amplified cDNA fragment. 



The resultant cDNA fragments are ligated into 
mammalian expression vector pcDNA-myc (Invitrogen, 
30 modified as described in Roy et al . , EMBO J. 16:6914- 
6925 (1997) ) and assembled to full-length cDNA by 
consecutively ligating adjacent fragments at the unique 
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endonuclease sites form the full-length cDNA. 
Sequencing analysis of the assembled full-length cDNA 
is carried out, and splice isoforms of CARD- containing 
polypeptides can be identified. 



5 4.0 Plasmid Constructions. Complementary DNA 

encoding a CARD-containing polypeptide, or a functional 
fragment thereof is amplified from Jurkat cDNAs with 
Turbo Pfu DNA polymerase (Stratagene) and desired 
primers, such as those described above. The resultant 
10 PCR fragments are digested with restriction enzymes 
such as EcoRI and Xho I and ligated into pGEX-4Tl 
(Pharmacia) and pcDNA-myc vectors. 

5.0 In vitro Protein Binding Assays. CARD- 

containing or fragments thereof encoded in pGEX-4Tl are 

15 expressed in XL-1 blue E. coli cells (Stratagene) , and 
affinity-purified using glutathione (GSH) -sepharose 
according to known methods, such as those in Current 
Protocols in Molecular Biology , Ausubel et al . eds. , 
John Wiley and Sons (1999) . For GST pull -down assays, 

20 purified CARD-GST fusion proteins and GST alone 

(0.1-0.5 \ig immobilized on 10-15 ]il GSH- sepharose 
beads) are incubated with 1 mg/ml of BSA in 100 yl 
Co- IP buffer (142.4 mM KC1, 5mM M g Cl 2 , 10 mM HEPES (pH 
7.4), 0.5 mM EGTA, 0.2% NP-40, 1 mM DTT, and 1 mM PMSF) 

25 for 30 min. at room temperature. The beads are then 
incubated with 1 ]il of rat reticulocyte lysates 
(TnT-lysate; Promega, Inc.) containing 35 S-labeled, in 
vitro translated CARD-containing or control protein 
Skp-1 in 100 \il Co-IP buffer supplemented with 0.5 

3 0 mg/ml BSA for overnight at 4°C. The beads are washed 
four times in 500 \xl Co-IP buffer, followed by boiling 
in 20 \il Laemmli-SDS sample buffer. The eluted 
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proteins are analyzed by SDS-PAGE. The bands of 
SDS-PAGE gels are detected by f luorography . 

The resultant oligomerization pattern will 
reveal that CARD : CARD and other protein : protein 
5 interactions occur with CARD- containing polypeptides or 
fragments thereof . 



In vitro translated candidate CARD-associated 
polypeptides such as Apaf-1 (lacking its WD domain), 
CED4, and control Skp-1 are subjected to GST pull -down 

10 assay using GSH-sepharose beads conjugated with GST and 
GST-CARD-containing polypeptides as described above. 
Lanes containing GST -CARD yield significant signals 
when incubated with a CARD-associated polypeptide 
whereas, the controls GST alone and Skp-1 yield 

15 negligible signals. 



6-0 Protein Interaction Studies in Yeast, EGY4 8 

yeast cells (Saccharomyces cerevisiae: MATa, trpl, 
ura3, his, leu2 : : plexApo6 - leu2 ) are transformed with 
pGi Ida -CARD plasmids (his marker) encoding the LexA DNA 

2 0 binding domain fused to: CARD- containing polypeptides, 
fragments thereof, or CARD-associated polypeptides. 
EGY48 are also transformed with a LexA-LacZ reporter 
plasmid pSH184 0 (ura3 marker) , as previously described 
(Durfee et al . , 1993; Sato et al . , 1995). Sources for 

25 cells and plasmids are described previously in U.S. 

Patent 5,632,994, and in Zervous et al . , Cell 72:223- 
232 (1993); Gyuris et al . , Cell 75:791-803 (1993); 
Golemis et al . , In Current Protocols in Molecular 
Biology (ed. Ausubel et al . ; Green Publ . ; NY 1994), 

30 each of which is incorporated herein by reference. 
Transformants are replica-plated on Burkholder's 
minimal medium (BMM) plates supplemented with leucine 
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and 2% glucose as previously described (Sato et al . , 
Gene 140:291-292 (1994)). Protein-protein interactions 
are scored by growth of transf ormants on leucine 
deficient BMM plates containing 2% galactose and 1% 
5 raffinose. 



Protein-protein interactions are also 
evaluated using (3-galactosidase activity assays. 
Colonies grown on BMM/Leu/Glucose plates are 
filter-lifted onto nitrocellulose membranes, and 

10 incubated over-night on BMM/Leu/galactose plates. 
Yeast cells are lysed by soaking filters in liquid 
nitrogen and thawing at room temperature, 
p-galactosidase activity is measured by incubating the 
filter in 3.2 ml Z buffer (60 mM, Na 2 HP0 4 , 4 0 mM 

15 Na 2 HP0 4 , 10 mM KC1, 1 mM MgSOj supplemented with 50pl 
X-gal solution (20 mg/ml) . Levels of p-galactosidase 
activity are scaled according to the intensity of blue 
color generated for each transf ormant . 



The results of this experiment will show 
2 0 colonies on leucine deficient plates for yeast 
containing CARD/LexA fusions together with CARD- 
associated polypeptide/B42 . In addition, the 
CARD/LexA: CARD- associated polypeptide/B42 cells will 
have significant amounts of LacZ activity. 



25 7.0 Self -Association of NB-ARC domain of CARD- 

containing polypeptides. In vitro translated, 
35 S- labeled rat reticulocyte lysates (1 containing 
NB-ARC or Skp-l (used as a control) are incubated with 
GSH-sepharose beads conjugated with purified GST-NB-ARC 

30 or GST alone for GST pull -down assay, resolved on 

SDS-PAGE and visualized by fluorography as described 
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above. One tenth of input is loaded for NB-ARC or 
Skp-1 as controls. 



8.0 Protein-Protein Interactions of CARD- 

containing polypeptides. Transient transfection of 
5 293T, a human embryonic kidney fibroblast cell line, 
are conducted using SuperFect reagents (Qiagen) 
according to manufacturer's instructions. The cDNA 
fragments encoding full-length CED4 and the truncated 
form of Apaf-1 (Apaf-IAWD) comprising amino acids 1-42 0 

10 of the human Apaf-1 protein are amplified by PCR and 
subcloned into pcDNA3HA at EcoRI and Xho 1 sites. 
Expression plasmids encoding catalyt ically inactive 
forms of caspases such as pro-Casp8 (pro-Casp8 (C/A) ) 
are prepared by replacing Cys 3 77 with an Ala using 

15 site-directed mutagenesis and pro-Casp9 (pro~Casp9 

(C/A) ) has been described previously, Cardone et al . , 
Science 282:1318-1321 (1998)). 293T cells are 
transiently transfected with an expression plasmid (2 
lag) encoding HA-tagged human Apaf-IAWD, CED4 , pro-Casp8 

20 (C/A) or C-Terminal Flag-tagged pro-Casp9 (C/A) in the 
presence or absence of a plasmid (2 jig) encoding myc- 
tagged CARD -containing polypeptide. After 24 hr growth 
in culture, transfected cells are collected and lysed 
in Co-IP buffer (142.4 mM KC1, 5 mM MgCl 2/ 10 mM HEPES 

25 (pH 7.4) , 0.5 mM EGTA, 0.1 % NP-40, and 1 mM DTT) 

supplemented with 12 . 5 mM P -glycerophosphate , 2 mM 
NaF, 1 mM Na 3 V0 4/ 1 mM PMSF, and IX protenase inhibitor 
mix (Boehringer Mannheim) . Cell lysates are clarified 
by microcentrif ugation and subjected to 

3 0 immunoprecipitation using either a mouse monoclonal 

antibody to myc (Santa Cruz Biotechnologies, Inc) or a 
control mouse IgG. Proteins from the immune complexes 
are resolved by SDS-PAGE, transferred to nitrocellulose 
membranes, and subjected to immunoblot analysis using 
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anti-HA antibodies followed by anti-myc antibodies 
using a standard Western blotting procedure and ECL 
reagents from Amer sham- Pharmacia Biotechnologies, Inc. 
(Krajewski et al . , Proc. Natl. Acad. Sci . USA 96:5752- 
5 5757 (1999) ) . 



9.0 Cloning and characterization of CARD2X. CARD2X- 
encoding cDNA was obtained by PCR using primers 
CGGAATTCATGGCTACCGAGAGTACTCC (SEQ ID NO: 76) and 
GTAAAACGACGGCCAGT (SEQ ID NO: 77) to amplify a 0.9 kb 

10 cDNA molecule from a human skeletal muscle cDNA library 
(Clontech) . The PCR products was then purified by 
agarose gel electrophoresis and the purified products 
subcloned into pBluescript II SK vector (Stratagene) . 
Using the forward primers, the PCR fragments were 

15 directly sequenced using the ABI PRISM Big Dye Terminal 
Cycle sequencing kit, according to manufacturer's 
instructions (Perkin Elmer) . Based on the sequence 
obtained, a third CARD2X- specif ic primer was generated 
having the sequence GCAGAAGCCACTGTGGAAGAGGAGGTT (SEQ ID 

20 NO:78) . In identifying the 3 ' end of the CARD2X- 

encoding cDNA, this third CARD2X- specif ic primer was 
used in conjunction with a phage- specific primer having 
the sequence ATACGACTCACTATAGGGCGAATTGGCC (SEQ ID 
NO: 79) to amplify a 0 . 3 kb cDNA molecule using methods 

25 described above. The 0.3 kb cDNA molecule was cloned 
and sequenced as described above, and the sequences of 
the 0.3 and 0 . 9 kb cDNA molecules were merged to 
produce a 1.0 kb cDNA sequence. 



The sequence of CARD2X was confirmed. 
30 Additional 5' untranslated sequence was identified 
(nucleotide sequence of CARD2X including 5 1 
untranslated sequence, SEQ ID NO: 84). The CARD domain 
extends from amino acids 4 to 78 of SEQ ID NO: 12. 
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The association between CARD 2 X and other 
CARD- containing proteins was determined. HEK 293T 
cells in 6-well plates were transfected using SuperFect 
(Qiagen) with pairwise combinations of Myc-tagged or 
5 FLAG- tagged CARD2X, CARDIAK or NODI (total DNA 2)ig) . 

After 24 hours, cells were collected in 400 ]il of lysis 
buffer (20mM.Tris, pH 7.4, 150mM NaCl , 1% NP-40, and 
ImM EDTA supplemented with lx protease inhibitors mix 
(Roche/Boehringer Mannheim) ) . Cell lysates were 

10 clarified by centrif ugation and subjected to 

immunoprecipitation using Agarose-beads conjugated with 
anti-FLAG M2 antibody (Sigma) . Immune- complexes were 
washed three times with wash buffer (20mM Tris, pH 7.4, 
lOOmM NaCl, 0.05% NP-40, and ImM EDTA), and resolved on 

15 SDS-PAGE gels. Proteins in the gels were transferred 
to nitrocellulose membranes, immunoblotted with anti- 
Myc antibodies, and detected with ECL 
(Amersham- Pharmacia Biotech). Epitope-specif ic 
antibodies for myc, FLAG, or HA tag were obtained from 

2 0 Santa Cruz Biotech, Roche/Boehringer Mannheim, and 
Sigma. The results of these co- immunoprecipitation 
assays demonstrated that CARD2X specifically associates 
with both NODI and with CARDIAK. 



The effect of CARDIAK on CARD2X 
25 phosphorylation was next determined. HEK 293T cells 
transiently expressing FLAG- CARDIAK were lysed and 
immunoprecipitated with Agarose-beads conjugated with 
anti-FLAG M2 antibody. In vitro phosphorylation was 
performed in the immune complex with or without 
3 0 purified Myc-CARD-2X as a substrate. The kinase 

reaction was initiated by adding l\iM of [y- 32 P]ATP in 
10]Xl of kinase buffer (50mM Tris, pH7.4, lOOmM NaCl, 
6mM MgCl 2 , ImM MnCl, and ImM EDTA) . After 20min at 
37°C / the reaction was stopped by adding lOyl of 2x SDS 
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sample buffer, and subjected to SDS-PAGE and 
autoradiography. The results of these assays indicated 
that CARD 2 X is not phosphorylated directly by CARDIAK. 

Phosphatase assays were also performed to 
5 examine phosphorylation of CARD2X. HEK 293 cells were 
transfected with plasmids encoding Myc-CARD-2X with or 
without FLAG - GARDIAK or FLAG-CARDIAK (K4 7M) , which is a 
kinase deficient mutant of CARDIAK. The cleared 
lysates were diluted 1:20 with 20p.l of reaction buffer 

10 (25mM Tris, pH8.0, 50mM MaCl , 5mM MgCl 2 ) , and 

optionally treated with 2 units of calf intestine 
alkaline phosphatase (Gibco BRL) for 3 0min at 3 7°C. 
The reaction was terminated by adding 4x SDS sample 

buffer, and subjected to SDS-PAGE and immunoblot. The 

15 phosphorylated form of CARD 2 X migrates more slowly that 
CARD2X, and is not observed after phosphatase 
treatment. The results of these assays indicated that 
CARD2X is phosphorylated in vivo in the presence of 
either CARDIAK or kinase-def icient CARDIAK, but not in 

20 their absence. Taken together with the in vitro 

phosphorylation results above, these results indicate 
that CARDIAK is indirectly involved in CARD2X 
phosphorylation . 

The 30-35 residues at the carboxy terminus of 
25 CARD2X have homology to human Alu family sequences and 
RhoGAP. Thus, this region can have activity similar to 
that observed in human Alu family sequences and RhoGAP. 

10.0 Cloning and characterization of CLAN. CLAN 
encoding cDNA was obtained by polymerase chain reaction 
30 (PCR) using primers CXF1 : TACTTACTTTGTCCCTTCA (SEQ ID 
NO: 74) and CXR2 : TATTTGTCCCCATCTCGTC (SEQ ID NO: 75) to 
amplify cDNA from a human genomic library. Thirty 
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cycles of PCR were carried out using Turbo Pfu DNA 
polymerase (Stratagene) at annealing temperature 47 °C 
and extension temperature 72 °C. The PCR product was 
then purified by agarose gel electrophoresis and the 
5 purified product subcloned into pGEM-T vector 
(Promega) . 



The HTSG database of human genomic DNA 
sequence data was searched for regions capable of 
encoding CARDs using the CARD amino-acid sequence of 

10 cIAP-1 as a query with the TBLASTn method. This search 
revealed strong homology with a human genomic clone 
(Accession number: AQ889169) that mapped to human 
chromosome 2p21-22. This locus was not recognized in 
the human genomic database and was not previously 

15 annotated. In initial studies, two genes encoding CARD 
domain containing polypeptides, designated CARD4X and 
CARD5X, were identified. Upon further 

characterization, it was determined that CARD4X (also 
known as NAC-X or NAC-4) and CARD 5 X were actually 

2 0 encoded by the same gene, which is therefore referenced 
as CARD4/5X. CARD4/5X was subsequently designated 
CLAN, which stands for "CARD, LRR and NACHT-containing 
protein," because at least one of the proteins encoded 
by it contains CARD, Leucine Rich Repeat (LRR) and 

2 5 NACHT (NB-ARC) domains, as described below. 



The CLAN gene locus lies in close proximity 
to the gene encoding Spastin (on chromosome 2p21-22) , a 
AAA protein which is frequently mutated in autosomal 
dominant hereditary spastic paraplegia (AD-HSP) . The 
3 0 CLAN locus is found on the strand opposite the SPG4 

(SPAST) locus but with no overlapping regions. This 
result suggests that mutations in the CLAN gene 
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potentially occur in patients with this 
neurodegenerative disorder. 



Using GENE SCAN for exon prediction, 
additional regions potentially encoding a NACHT (NB- 
5 ARC) domain and regions corresponding to Leucine-Rich 
Repeat (LRR) domains were also recognized 3' to the 
potential CARD-encoding sequences, suggesting the 
presence of a CED4-like gene. 



10.1 Cloning of CLAN cDNAs . CLAN- specific primers 

10 corresponding to sequences within the putative CARD and 
NACHT (NB-ARC) regions (as determined from genomic DNA 
sequence data) were used in conjunction with 2 
universal primers to isolate CLAN cDNAs from first - 
strand liver and lung cDNA by nested PCR according to 

15 the manufacturer's protocol (SMART RACE, Clontech) . 

Primers used for amplification are 5' RACE primers (5'- 
CATGTGAATGATCCCTCTAGCAG-3' (SEQ ID NO: 153); nested 5'-. 
GGGCTCGGCTATCGTGCTCTA-3 ' (SEQ ID NO: 154)) and 3' RACE 
primers ( 5 ' - ACGATAGCCGAGCCCTTATTC- 3 ' (SEQ ID NO: 155); 

20 nested 5 ' -GTATGGAATGTTCTGAATCGC- 3 ' (SEQ ID KO;156)). 

Amplification products were purified from agarose gels, 
ligated into the TA cloning vector (Promega) , and 
sequenced. Four open reading frames were deduced and 
multiple clones of each isoform were sequenced to 

25 ensure fidelity of PCR products. 



The longest transcript, termed CLAN- A, was 
3.370 kilobasepairs (kbp) in length (SEQ ID NO: 96) with 
an open reading frame (ORF) coding for a 1024 amino- 
acid protein (SEQ ID NO: 97) containing a CARD, NACHT 
30 (NB-ARC) , and LRR-domains, as well as a predicted SAM 
domain. A second transcript, termed CLAN-B, was 1.374 
kbp in length (SEQ ID NO: 98), with an ORF coding for a 
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359 amino-acid protein (SEQ ID NO: 99) containing an 
identical CARD directly spliced to the LRRs . CLAN-C, 
the third transcript isolated, was 0.768 kbp in length 
(SEQ ID NO: 102) and encoded a 156 amino acid protein 
5 (SEQ ID NO: 103) containing the CARD and an additional 
region lacking homology to recognizable domains. 
Finally, the shortest transcript found, CLAN-D, was 
0.578 kbp in length (SEQ ID NO: 100) and contained an 
ORP encoding a 92 amino-acid protein (SEQ ID NO: 101) 
10 encompassing only the CARD followed by 9 amino acids. 

Comparisons of these cDNA sequence data with 
the genomic DNA sequence data found in the HTSG 
database suggested that the CLAN gene consists of 12 
exons, spanning 41.3 kbp on chromosome 2p21-22 (Figure 
1A) . Six differences were found between the sequence 
of the CLAN cDNA and the sequence within the public 
database. Additionally, nucleotide regions 1-12 and 
3372-3396 do not have equivalent fragments in the 
public database. 

Southern blot analysis was also performed. 
For Southern blot analysis, 10 ]ig of restriction 
endonuclease (EcoRI or PstI) digested genomic DNA was 
loaded per lane and hybridized with the CARD domain of 
CLAN as a probe. The probe was derived from the CLAN 
A-isoform (see Figures 1 and 2), nucleotides 276 to 507 
plus an additional 2 0 upstream nucleotides, which are 
not present in the cDNA but are present in the genomic 
DNA. CLAN was found to be a single copy gene. 

Two different transcriptional start sites are 
3 0 utilized (corresponding to the beginning of either exon 
1 or 2) ; however both are spliced to exon 3 at the 
beginning of the CARD. Exons 6 and 7 contain 
additional internal splice donor sites which are 



15 



20 



25 
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utilized to generate CLAN-G. Figure IB shows the 
pattern of mRNA splicing events predicted to give rise 
to the CLAN-A, CLAN-B, CLAN-C, and CLAN-D transcripts 
and encoded proteins. All the exon/intron splice 
5 junctions follow the conserved GT/AG consensus rule. 



As predicted by SMART (EMBL, Heidelberg, 
Germany) , CLAN contains a CARD (amino acids 1-87 of SEQ 
ID NO: 97) . A ty-BLAST search of the non-redundant 
database using the CLAN CARD as query identified 
10 several homologous CARDs including those from cIAPl and 
2 (58%), caspase-1 and ICEBERG (50%), Nodi, Nod2, and 
Card8 (-38%) and caspase-13, Ced3 , caspase-9, Bel 10 
(CIPER) and CARKIAK/RIP2 (-30%) . 



Following the CARD, a domain containing 
15 consensus sequences for Walker A and B boxes is present 
(Walker et al . , EMBO J. 8:945-951 (1982)) as well as 
additional characteristics of the family of NTPases 
termed the NACHT family (Koonin et al . , Trends . 
Biochem. Sci . 25:2230224 (2000)). By Tjf-BLAST search 
2 0 the NACHT domain of CLAN ( "NB" in Figure 1, amino acids 
161-457 of SEQ ID NO: 97) shows highest similarity to 
the NACHT domain of NAIP (60%), followed by Nodi (49%) 
and Nod2 (47%) . 

Leucine Rich Repeat (LRR) domains are also 
25 found near the C-terminus of the A and B isoforms of 
the protein. The C- terminal end consists of four 
repeated LRRs, each containing a predicted (3 sheet and 
a helical structure, which is in agreement with the 
prototypical horseshoe -shaped structure of LRRs (Kobe 
30 et. al., Curr. Opin. Struct. Biol . 5:409-416 (1999).. 
LRR 1 (amino acids 760-791 of SEQ ID NO: 97) represents 
a non-Kobe and Deisenhofer (non-K/D) LRR, whereas LRRs 
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2, 3, and 4 (amino acids 817-848; 845-876; and 934-965 
of SEQ ID NO: 97, respectively) are in accordance with 
Kobe and Deisenhofer (K/D) LRR . LRR 2 also shares 
sequence homology to a prototypical Ribonuclease 
5 Inhibitor type A (RI type A) . By BLAST searches the 
LRRs show 4 9% sequence identity to the placental 
ribonuclease/angiogenin inhibitor (RAI) . 



Sequences located between the NACHT (NB-ARC) 
and LRR domains show some similarity to the sterile 
10 alpha motif (SAM) (amino acids 642-696 of SEQ ID 

NO: 97), a domain built of five alpha helices originally 
found in proteins involved in numerous developmental 
processes. The SAM domain has been shown to function 
as a protein-protein interaction domain, with ability 
15 to homo-as well as hetero-oligomerize with other SAMs 
(Stapleton et al . , Nat. Struct. Biol . 6:44-49 (1999)). 



10.2 In vivo expression of CLAN. In order to 

determine which of the various splice variants of CLAN 
are expressed in adult human tissues, Northern blot 

2 0 analysis was performed. Hybridization probes 

corresponding to the common CARD domain of all 4 CLAN 
isoforms or the NACHT and LRR regions were radiolabeled 
by random priming with hexanucleotides (Roche) and a- 
32 P-dCTP, or Digoxigenin- labeled with a commercially 
25 available kit (Roche) , incubated with blots containing 
human poly (A) + RNA derived from various human tissues 
(Origene) , washed at high stringency, and exposed to X- 
ray film. Positive signals were detected by 
autoradiography or by immunoblotting with HRP- 

3 0 conjugated ant i- DIG antibody and an enhanced 

chemiluminescence method (ECL) (Amersham) . 
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Northern blot analysis with CARD of CLAN 
revealed expression of an approximately 1.5 kbp 
transcript corresponding to CLAN-B in nearly all 
tissues examined, with highest expression in lung and 
5 spleen. Northern blot analysis using, the NACHT and LRR 
of CLAN-A as a probe revealed expression of an 
approximately 3 . 5 kbp mRNA corresponding to CLAN-A 
primarily in the lung. 



To further explore the tissue-specific 
10 patterns of expression of CLAN splicing variants, RT- 

PCR assays were devised specific for the A, B, C, and D 
isof orms . A panel of cDNA specimens derived from 
various human tissues was utilized (Clontech) , as well 
as blood cells, prepared as followed. Peripheral blood 
15 leukocytes were obtained from heparinized venous blood 
by Ficoll-Paque (Amersham) density-gradient ' 
centrif ugation. Red blood cells were removed from 
granulocytes by short incubation in hypotonic lysis 
buffer. Monocytes were separated from lymphocytes by 
20 adherence to plastic dishes. Total RNA was isolated 

from cells using TRIZOL reagent (BRL) and 2 ug was used 
to generate cDNA in a reverse transcription reaction 
with Superscript II (BRL) . 



PCR was carried out on the cDNA samples in an 
25 Eppendorf thermal cycler using Taq polymerase (BRL) and 
the following isof orm-specif ic primer pairs: CLAN-A 5'- 
GGTGGAGCAGGATGCTGCTAGAGG-3 ' (SEQ ID NO: 159), 5'- 
CACAGTGGTCCAGGCTCCGAATGAAGTCA-3 ' (SEQ ID NO: ISO); CLAN- 
B 5 ' -CATCATTTGCTGCGAGAAGGTGGAG- 3 ' (SEQ ID NO: 161), 5'- 
30 TTAACTTGGATAACACTTGGCTAAG-3' (SEQ ID NO: 162); CLAN-C 
5 ' - GTAAACATCATTTGCTGCGAGAA - 3 ' (SEQ ID NO : 163 ) , 5 ' - 
CCCGGGCAGGTAGAAGATGCTAT-3 ' (SEQ ID NO: 164); CLAN-D 
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5 ' AATTTCATAAAGGACAATAGCCGAG - 3' (SEQ ID NO : 165) , 5 ' - 
TGTCTACTGTACTTTCTAAGCTGTT-3 ' (SEQ ID NO: 166), 



RT-PCR analysis showed that CLAN-B was 
5 present throughout human tissues (brain, heart, kidney, 
liver, lung, pancreas, placenta, skeletal muscle, 
colon, ovary, leukocytes, prostate, small intestine, 
spleen, testis, thymus) , consistent with the Northern 
blot analysis. In contrast, CLAN-A was restricted to 

10 lung, colon, brain, prostate, spleen and leukocytes, 
but not other tissues. Further analysis of leukocyte 
sub-populations revealed expression of the CLAN-A 
isoform predominantly in the monocyte cell fraction, 
with lower expression found in granulocytes and no 

15 expression in lymphocytes. Expression of CLAN-C was 
absent in all normal tissues tested, however, 
expression was evident in the cell line HEK293T, 
suggesting this transcript can be produced under some 
circumstances. CLAN-D transcripts were detected only in 

2 0 brain by RT-PCR . 



RT-PCR was also performed on cell lines. 
RT-PCR was performed using the same CLAN primers as 
used for RT-PCR in normal tissues, as described above. 
RT-PCR was performed in various tumor derived cell 

2 5 lines: M2 , OVCAR3 , HEY, HaCaT, 2 93T, SKOV-3, Jurkat, 

BG-1, 697, HL-60, PC3 , DU145, MDA-MB-231, MCF-7, MDA- 
MB-4, HS578T, BT-549, and T-47D. Beta-actin primers 
were used as a control. In contrast to normal tissue, 
the transcript for CLAN was mostly absent in the cell 

3 0 lines tested. Weak expression was found in the cell 

lines 697, MDA-MB-231, MVF-7, MDA-MB-4, HS578T, and T- 
47D. 
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10.3 CLAN protein interactions. Interactions 

between the CARD of CLAN and known CARD domains were 
tested in vitro and in vivo. 



To test CLAN interactions with other 
5 molecules, an in vitro binding assay was performed. 
CLAN was in vitro translated in the absence of label 
(i.e., cold). Other cellular proteins were labeled in 
vitro with 35 S-Met: CLAN, caspasel, caspase2, caspase8, 
caspase9, caspaselO, Apafl, Apaf 1-CARD, NACa, NAC-CARD, 
10 BcllO, ASC, cIAPl, cIAP2, XIAP, Nodi, Ced4 , RAIDD, and 
CARDIAK. The in vitro translated proteins were mixed 
separately with unlabeled CLAN and 

co-immunoprecipitated using an antibody against an 
epitope tag fused to CARD5X, either myc or hemaglutinin 
15 (HA) . CLAN associated proteins were eluted by boiling 
in Laemmli denaturing buffer and separated by 12% SDS- 
PAGE. The radioactive bands were visualized by 
f luorography . 

Weak binding to CLAN was observed with 
20 caspase2 and cIAPl, with stronger binding to Nodi and 
Cardiak. The strongest binding was observed with Ced4. 
Caspase8 binding is possibly due to its stickiness. 
There was no association detected between CLAN and 
itself . 



25 To prepare appropriate expression vectors for 

in vivo interaction studies, a cDNA encoding the CLAJ\T 
CARD domain was amplified using PFU polymerase and 
specific primers ( 5 ' - CCCGGATCCATGAATTTCATAAAGGACAATAGC - 
3' (SEQ ID NO; 153); 5' -CCCTTCGAACAAGTCCTGAAATAGAGGATA- 

30 3' (SEQ ID N0:154)) containing BamHI and Hindlll sites. 
The resulting PCR product was ligated into pcDNA3.1 
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(- ) /Myc-Hisg A ( Irrvitrogen) which places the myc-His 
tag at the C- terminus of expressed proteins. 
pcDNA3/HA-CLAN (CARD) was created using a similar 
strategy. Authenticity of all vectors was confirmed by 
5 DNA sequencing. 



The CARD of CLAN was expressed as an epitope- 
tagged protein in HEK293T cells in co-transf ections 
with a variety of other epitope- tagged CARD- containing 
proteins, and lysates derived from these cells were 

10 used for co- immunoprecipi tation assays. Briefly, 

HEK293T cells were seeded onto six-well plates (35mm 
wells) and transfected with 0.2-2 mg plasmid DNA using 
Superfect (Qiagen) 24 hr later. After culturing for a 
day, cells were collected and lysed in isotonic lysis 

15 buffer (142.4 mM KCl , 5 mM MgCl 2 , 10 mM HEPES (pH 7.4), 
0.5 mM EGTA, 0.2% NP-4 0, 12 . 5 mM b-glycerophosphate , 2 
mM NaF , 1 mM Na 3 V0 4 , 1 mM PMSF, and IX protease 
inhibitor mix (Roche) ) . Lysates were clarified by 
centrif ugation and subjected to immunoprecipitation 

20 using agarose-conjugated anti-c-myc antibodies (Santa 
Cruz) , or non-specific control antibodies and Protein 
G-agarose for 2-24hr at 4°C. Immune -complexes were 
washed four times with lysis buffer, boiled in Laemmli 
buffer, and separated by 12-15% PAGE. Immune -complexes 

2 5 were then transferred to PVDF membranes and 

immunoblotted with anti-c-myc (Santa Cruz) , anti-HA 
(Roche), or anti-flag (Sigma) antibodies. Membranes 
were washed, incubated with HRP-conjugated secondary 
antibodies, and reactive proteins were detected using 

30 ECL. 



Co- immunoprecipitation analysis indicated 
that the CARD of CLAN bound readily to full-length pro- 
caspase-1 but did not significantly bind another CARD- 



WO 01/90156 PCT/US01/17158 

119 

containing caspase, caspase- 9. Among the other CED-4 
family members which contain a CARD in conjunction with 
a nucleotide-binding domain, CLAN interacted with the 
CARDs of Nod2 and NAC, but not with Apaf-1 or Nod-1. 
5 Finally, the CLAN CARD was found to associate with Bel- 
10, but not with another adapter protein, RAIDD . 

11-0 Cloning and characterization of CARD3X Based 

on an analysis of the overlapping genomic contigs GI 
8575872 and GI 5001450, a cDNA sequence for CARD3X was 
10 predicted (SEQ ID NO: 82), that encoded amino acid 
sequences designated SEQ ID NOS:83 and 107. 

For identification of novel domains in 
CARD3X, the sequence of the CARD domain of polypeptide 
CARD3X was used as a query for a tblastn search in the 
15 HTGS database, and two overlapping genomic contigs were 
found (GI numbers 5001450 and 8575872) . This contig 
was analyzed using the GenScan server 

(http;// ccr-081 .mit . edu/ GENS CAN. html ) for the presence 
of exons. (Burge and Karl in, J. Mol . Biol. 268:78-94 
2 0 (1997) ) . The predicted protein sequences coded by the 
exons were analyzed by comparison with the NCBI nr 
protein sequence database using PSI -BLAST. The 
predicted protein sequences coded by the exons were 
analyzed also by comparison with a database of proteins 

2 5 with known three-dimensional structures and apoptosis 

related domains using the profile-profile comparison 
server at 

http : //bioinf ormatics .bumham-inst . org/FFAS_apoptosis 
(Rychlewski, et al . , Protein Science 9:232-241 (2000)). 

3 0 CARD3X contains two CARD domains, a CARD- A 

and CARD-B domain (see Figure 3) . An NB-ARC domain was 
also observed (see Figure 3) . The NB-ARC is similar to 
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both the CLAN and APAF-1 NB-ARC domains and to NB-ARC 
domains from several plant disease resistance proteins 
(Aravind et al . , Trends Biochem. Sci . 24:47-53 (1999); 
Young, Curr . Qpin. Plant Biol, 4:2 85-289 (2000)). 

5 An angio~R domain was also identified at 

amino acids 457-839 of SEQ ID NO:107. An "angio-R" is 
a new domain that can be defined as a region of a 
polypeptide chain that bears substantial similarity 
(e.g. 25, 30, 40% sequence identity) to the 514-reside 
10 long protein "angiotensin II/vasopressin receptor" 

(described in Ruiz-Opazo et al . , Nature Med. 1:1074- 
1081 (1995)). The "angio-R" domain has not been 
previously described in any protein. 

To confirm the predicted sequences, cDNAs 
15 were cloned and sequenced. The CARD3X cDNA was cloned 
using a Rapid-Screen™ Arrayed Placenta cDNA Library 
Panel from Origene Technologies, Inc. The library 
cDNAs had been pre- selected for long clones, 
unidirectionally cloned into the vector pCMV6-XL4, and 
20 arrayed in a 96~well format. An initial Master Plate 
containing 500,000 cDNA clones was screened by PCR, 
using the forward primer 5 1 -GAAATGTGCTCGCAGGAGG- 3' 
(SEQ ID NO: 185) and the reverse primer 

5 1 -GATGAGCTTCTGACAGGCCC- 3' (SEQ ID NO: 186). A set of 
25 5000 clones that were initially positive by PCR were 
screened again with the same set of primers. Positive 
clones were plated on LB /Amp plates, and a further 
round of single colony PCRs was performed in order to 
obtain the desired clone. 

3 0 Three independent clones were sequenced, each 

of which corresponded to the nucleotide sequence SEQ ID 
NO:187. The cDNA sequence differed at both the N- and 
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C-terminal ends from the CARD 3 X sequence predicted from 
analysis of genomic exons . SEQ ID NO: 187 encodes a 
polypeptide of 795 amino acids (SEQ ID NO:188), 
followed by a stop codon. A second open reading frame 
5 begins after the stop codon, and in the same reading 
frame, and encodes a polypeptide of 180 amino acids 
(SEQ ID N0:189). SEQ ID NO:189 contains several 
leucine rich repeats. 

Subsequent to the identification of the two 
10 polypeptides encoded by SEQ ID NO: 187, a publication 
reported the cloning of a gene designated Nod2 cloned 
(Ogura et al . , J. Biol . Chem . 276:4812-4818 (2001)). 
The published Nod2 sequence has additional N-terminal 
amino acids relative to SEQ ID NO: 188 and, instead of 
15 the stop codon between the residues that encode SEQ ID 
NO: 188 and SEQ ID NO: 189, additional coding sequence is 
present, which encodes several additional leucine rich 
repeats. The published Nod2 sequence is 1040 amino 
acids . 



2 0 It is proposed that SEQ ID NO: 188 is a splice 

variant form of CARD3X/Nod2 that does not contain an 
LRR domain. The LRR of Nod2 has been shown to 
interfere with the ability of the protein to activate 
NFkB (Ogura et al . , supra (2001)). Therefore, SEQ ID 

2 5 NO: 188 is likely expressed under physiological 

conditions in which activation of NFkB is required. 



Human CARD3X cDNA sequences were used as a 
query for BLAST searches of several mouse databases. A 
3 0 genomic sequence, SEQ ID NO: 190, was identified. 

Nucleotides 191-614 of SEQ ID NO: 190 are homologous to 
the ANGIO-R 'coding region of human CARD3X, Nucleotides 
193-612 of SEQ ID NO: 191 -were predicted to encode SEQ 
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ID NO: 191, which is highly homologous to amino acids 
214-341 of the ANGIO-R domain of human CARD 3 X (SEQ ID 
NO:176) . 

PCR was then performed on mouse genomic DNA 
5 obtained from C57B6 and NIH3T3 cell lines, using the 
following primers: Forward primer: 

5 ■ -CTGCAGAAGGCTGAGCCACACAACCT-3 • (SEQ ID NO:194), 
Reverse primer : 5 1 -ACAGAGTTGTAATCCAGCTGTAGGGCCACA-3 ' 
(SEQ ID NO: 195) . The PCR product so obtained was 

10 sequenced (SEQ ID NO-.192), and shown to have several 

nucleotide differences as compared to the corresponding 
region of SEQ ID NO: 190. The predicted amino acid 
sequence encoded by SEQ ID NO: 192 (designated SEQ ID 
NO: 193) had a single amino acid difference in 

15 comparison with SEQ ID NO: 191. 

Both the CARD -A and CARD-B domains are 
independently cloned into pcDNA3 with epitope tags such 
as myc or HA, as described above, and binding of the 
CARD domains is tested with co-immunoprecipitation to 
20 test binding of CARD3X CARD domains with other known 
CARD domains, as described above. 

The NB-ARC domain is cloned into a yeast two- 
hybrid vector and into pcDNA3 with two alternative 
epitope tags (e.g., myc and Flag) to determine whether 

25 the NB-ARC domain self -associates in an ATP-dependent 
manner/ P- loop mutation. The P-loop, which binds the 
gamma phosphate of ATP in the NB-ARC domain, is mutated 
to remove a conserved Lys in the consensus P-loop 
sequence G-S/T-K, where Lys is generally mutated to 

30 Met. The NB-ARC domain is also tested for binding to 
the NB-domains of other CED-4 like proteins (e.g., 
apafl, nodi, nac) . 
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12.0 Characterization of COP-1. Using the 

amino-acid sequence of the caspase-1 prodomain as a 
query for BLASTn searches of the public databases, a 
5 human EST clone (GenBank accession number AA070591) 
was identified containing an ORF encoding a 97 
amino-acid protein (SEQ ID NO: 86) predicted to share 
92% sequence identity with the CARD of pro-caspase-1 
(SEQ ID NO:87). The predicted protein contains a CARD 
10 (residues 1-91) , which is followed by 6 amino-acids 

and then a stop-codon. The CARD region of COP-1 showed 
97% identity to the CARD of pro-caspase-1. 

To confirm the predicted sequences, cDNAs 
were amplified from various adult human tissues and 
15 sequenced. The sequenced COP-1 cDNA (SEQ ID NO: 85) had 
the same nucleotide sequence as the original EST. 

The start codon initiating the ORF in the 
COP-1 cDNA clones resides in a favorable context for 
translation, and is preceded by an in- frame stop codon. 

20 The 3 1 - untranslated region contains TAAA and TATA 

motifs, typical of short-lived mRNAs which are subject 
to post -transcriptional regulation, and a candidate 
polyadenylation signal sequence (AATAAA) . Thus, this 
protein contains essentially only a CARD, prompting the 

25 moniker CARD Only Protein (COP-1) . 



To determine the genomic organization of the 
COP-1 gene, the COP-1 cDNA nucleotide sequence was 
employed for searches of the High Throughput Genomic 
Sequence (HTGS) database, resulting in identification 
30 of three genomic clones containing the COP-1 gene 
(GenBank accessions numbers AC027011, AP001153 and 
AP002787) . Comparison of the COP-1 cDNA and genomic 
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DNA sequences suggests a three exon structure, in which 
only the first two amino-acids are encoded in exon 1 
and only the last 5 residues are encoded in exon 3, 
such that most of the coding regions (including the 
5 entire CARD) are derived from exon 2. The introns 
separating exons 1, 2, and 3 are 631 and 844 bp in 
length, respectively, containing consensus dinucleotide 
splice donor (GT) and splice acceptor (AG) motifs. 

The COP-1 genomic clones identified in the 
10 HTSG database have been mapped to human chromosome 

llq22, which is the same chromosomal region where the 
pro-caspase-1 gene resides, as well as pro-caspase-4, 
pro-caspase-5, and ICEBERG. To address the genomic 
localization of COP, pro-caspase-4 , pro-caspase-5, and 
15 ICEBERG genes in chromosome 11, the public database of 
Human Genome Project Working Draft 

(www.genome.cse.ucsc.edu) was searched, and the order 
of these genes from centromere to telomere was 
determined to be pro-caspase-4, pro-caspase-5, 
20 pro-caspase-1, COP, and ICEBERG. This result suggests 
that COP-1 is a separate gene, presumably arising from 
duplication of other homologous genes in this locus. 

14.1 COP-1 expression. To study the expression of 

COP-1, Northern blot analysis was performed using RNA 

25 derived from several adult human tissues and a 
32 P-labeled COP-1 cDNA probe. Blots containing 
polyA-selected mRNA from various adult tissues 
(Clontech, Palo Alto, CA) were hybridized using a 
32 P-labeled COP-1 cDNA probe. The probe represented a 

30 570 bp length cDNA containing portions of the 

5 ' -untranslated region, the complete ORF, and portions 
of the 3 1 -untranslated region of COP. The COP-1 probe 
(from the EST clone corresponding to AA070591 obtained 
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from the I.M.A.G.E. Consortium (Washington University 
School of Medicine, St. Louis, MO) ) was excised from 
the plasmid by restriction digestion with EcoRI and 
Xhol, gel-purified, and radiolabeled by the random 
5 priming method using [a- 32 P] dCTP and a kit from Ambion 
(Austin, TX) . After hybridization, heat -denatured 
probe was annealed for 1 hr at 68°C with QuickHyb 
Hybridization Solution (Stratagene, La Jolla, CA) and 
then blots were washed with solutions containing 2x 
10 SSC, 0.1% (w/v) SDS (twice each for 15 min at 25°C) 

followed by 0 . lx SSC, 0.1% (w/v) SDS (twice for 10 min 
at 40°C) . Bands were visualized by autoradiography. 



Hybridizing bands of approximately 0.6 kbp, 
1.5 kbp and 2.6 kbp were identified, with the 0.6 kbp 

15 band representing the most abundant of these 

transcripts and presumably corresponding to the 
fully-spliced COP-1 mRNA. The less abundant larger 1.5 
kbp and 2.6 kbp transcripts could represent unspliced 
precursors. Alternatively, the 2.6 kbp mRNA could 

2 0 represent pro-caspase-1 mRNA, resulting from probe 

cross- hybridization. The 0.6 kbp COP-1 mRNA was most 
abundant in spleen, followed by liver, placenta, and 
peripheral blood leukocytes (PBL) . However, most 
tissues (including heart, muscle, colon, kidney, 

25 intestine and lung) were shown to contain at least some 
detectable 0.6 kbp COP-1 mRNA. 



To corroborate the Northern blot analysis, 
COP-1 mRNA expression in adult human tissues was also 
examined using RT-PCR and COP-specific primers. cDNA 
30 samples derived from multiple human adult tissues 

(Clontech, Palo Alto, CA) were amplified using a set of 

COP-specific primers (a forward primer 

5 1 -GAAGACAGTTACCTGGCAGA-3 1 (SEQ ID NO: 147) and a 
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reverse primer 5 ' -TTGTATTCTGAACATGGCACC-3 ' (SEQ ID 
NO-.148)) . The resulting PCR products were 
size- fractionated by electrophoresis in 1.5% agarose 
gels, then stained with ethidium bromide for UV- 
5 photography. In some cases, bands were excised from 
gels, purified, and sequenced, thus verifying 
amplification of the correct product by the RT-PCR 
assay. 



RT-PCR analysis showed that COP-1 mRNA was 
10 expressed in all tissues analyzed (brain, heart, 
muscle, colon, spleen, kidney, liver, intestine, 
placenta, lung and PBL) , except thymus. Parallel 
RT-PCR analysis of P-actin mRNA served as a control. 
In general, the relative levels of COP-1 mRNA detected 
15 by RT-PCR were in agreement with the Northern blot 
data . 

14.2 COP-1 interactions. The prodomain of 

pro-caspase- 1 is required for dimerization and 
activation of this zymogen. Since the prodomain of 

20 COP-1 shares a high-degree of amino-acid sequence 
identity with the prodomain of caspase-1, the 
possibility that COP-1 interacts with pro- caspase-1 in 
co-immunoprecipitation assays was tested. Interactions 
with several other CARD- containing proteins were also 

25 tested, including COP-1 itself, RIP2, Bcl-10, cIAPl, 
cIAP2 and pro-caspase-9 . 



For these experiments, the entire open 
reading frame (ORF) of COP-1 was amplified by PCR using 
the primers (5 1 -CCAGAATTCATGGCCGACAAGGTCCTGAAG-3 1 (SEQ 
3 0 ID NO: 14 5) (forward) and 

5 ' - CCACTCGAGCTAATTTCCAGGTATCGGACC - 3 ' (SEQIDN0:146) 
(reverse) . The COP-1 PCR product was digested with 
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EcoRI/XhoI and ligated into mammalian expression 
vectors pcDNA3-Myc, pcDNA3-HA and pcDNA3 - Flag at the 
EcoRI/XhoI cloning sites. Plasmids encoding wild-type 
pro-caspase-1, RIP2, and pro-IL-ip were as described in 
5 Thome et al . , Curr, Biol . 8:885-888 (1998); 

Nett-Fiordalisi et al . , J. Leukoc . Biol . 58:717-724 

(1995) ; and Wang et al . , J. Biol. Chem . 271:20580-20587 

(1996) . 



A pro-caspase-1 Cys 285 Ala mutant was made 
10 from wild-type caspase-1 plasmid by site- directed 
mutagenesis, using a commercially available kit 
(Stratagene, La Jolla, CA) and the primers 
5 ! -GATCATCATCCAGGCCGCCCGTGGTGACAGCCCTGG-3 1 (SEQ ID 
NO : 14 9) and 5 r - CCAGGGCTGTCACCACGGGCGGCCTGGATGATGATC - 3 1 
15 (SEQ ID NO:150) . A truncation mutant of pro-caspase-1 
in which a stop codon was introduced downstream of the 
CARD was created by PGR using primers 

5 1 -CGGAATTCATGGCCGACAAGGTCCTG-3 1 (SEQ ID NO: 151) and 
CGCTCGAGTTAGTCTTGCATATTAAGGTAATTTCCAGA- 3 1 (SEQ ID 
20 NO:152) . 



Human embryonic kidney 293T cells were 
cultured at 37°C in 5% C0 2 in Dulbecco's Modified 
Eagle's Medium (DMEM) with 10% heat- inactivated fetal 
bovine serum (FBS) . Cells in log phase were 

25 transfected in 60 mm diameter dishes with expression 
plasmids (5 \ig total DNA) using Superfect Transfection 
Reagent (Qiagen, Valencia, CA) according to the 
manufacturer's recommendations. Cells were harvested 2 
days later and lysed in ice-cold NP- 40 lysis buffer 

30 (10 mM HEPES [pH 7.4], 142.5 mM KC1, 0.2% NP-40, 5 mM 
EGTA) , supplemented with 1 mM DTT, 12.5 mM 
(5 -glycerophosphate, 1 uM Na 3 V0 4/ ImM PMSF, and IX 
protease inhibitor mix (Roche, Indianapolis, IN) . Cell 
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lysates (0.5 ml) were clarified by centrif ugat ion at 
16,0 0 0xg for 5 minutes, and subjected to 
immunoprecipitation using specific antibodies, 
including anti-Myc antibodies (Santa Cruz 
5 Biotechnology, Santa Cruz, CA) , and anti-Flag 

antibodies (Sigma, St. Louis, MO), in combination with 
15 yl Protein A- or G-Sepharose (Zymed, South San 
Francisco, CA) . 



Immune-complexes were fractionated by sodium 
10 dodecyl sulf ate-polyacrylamide gel electrophoresis 
(SDS-PAGE) , and transferred to nitrocellulose 
membranes. The resulting blots were incubated with 
various antibodies, including anti-HA antibodies 
(1:1000 v/v; Roche, Indianapolis, IN), anti-Myc 
15 antibodies (1:100 v/v; Santa Cruz Biotechnology, Santa 
Cruz, CA) and anti-Flag antibodies (1: 1000 v/v; Sigma, 
St. Louis, MO), followed by horseradish 
peroxidase -conjugated secondary antibodies, and 
detection by an enhanced chemi luminescence (ECL) method 
20 (Amersham- Pharmacia, Piscataway, NJ) . Alternatively, 
lysates were analyzed directly by immunoblotting after 
normalization for total protein content. 



The co- immunoprecipitation results showed 
that HA-COP-1 co-immunoprecipitated with Myc-COP, 

25 indicating that this protein can self -associate . In 
addition, HA-COP-1 co-immunoprecipitated with 
Myc-tagged pro-caspase-1 (C285A mutant) as well as with 
a fragment of pro-caspase-1 containing only its 
CARD-carrying prodomain. Thus, COP-1 binds 

3 0 pro-caspase-1 through its CARD domain. For these 
co- immunoprecipitation experiments, the active site 
cysteine of pro-caspase-1 was mutated to avoid 
induction of apoptosis, which can occur when 
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over- expressing this protease. Additionally, Myc-COP-1 
co-immunoprecipitated with Flag-RIP2. In contrast, 
COP-1 did not co-immunoprecipitate with the 
CARD- containing proteins Bcl-10, cIAPl, cIAP2, or 
5 pro-caspase-9, thus demonstrating the specificity of 
these results. 

RIP2 has been shown to bind and activate 
caspase-1 through the interaction of their CARDS , 
resulting in oligomerization of pro-caspase- 1 and its 
10 activation via the "induced proximity 1 ' mechanism. The 
data demonstrating that COP-1 binds to both 
pro-caspase-1 and RIP2 therefore suggested that COP-1 
might function as a modulator of RIP2- induced 
pro-caspase-1 oligomerization. 



15 To test this hypothesis, experiments were 

performed in which 293T cells were transiently 
transfected with expression plasmids encoding 
Myc- tagged pro-caspase- 1 (C285A mutant) and HA-tagged 
pro-caspase-1 (C285A mutant) , with or without 

20 Flag-tagged RIP2 and COP, after which Myc-pro-caspase-1 
and HA-pro- caspase-1 association was monitored by 
co-immunoprecipitation assays. 



As determined by this co-immunoprecipitation 
assay, pro-caspase-1 self -associated and this was 

25 enhanced by co-expression of RIP2 . However, when COP-1 
was also co-expressed, this RIP2 -mediated effect on 
pro-caspase-1 self -association was negated. These 
findings suggested the possibility of a competitive 
mechanism, in which COP-1 competes with RIP2 for 

30 binding to pro-caspase-1. To test this hypothesis, 

therefore, transfection experiments were preformed in 
which Flag-RIP2 and Myc-tagged pro-caspase-1 (C285A 
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mutant) were expressed in 293T cells in the presence of 
increasing amounts of HA-tagged COP-1. The effects of 
COP-1 on association of RIP2 with pro~caspase- 1 were 
then evaulated by co- immunoprecipitat ion assays in 
5 which immunoprecipitations were performed using 

anti-Flag antibody to recover Flag-RIP2 protein and the 
resulting immune-complexes were analyzed by 
SDS- PAGE/ immunoblot ting using anti-Myc antibody to 
detect associated Myc-pro-caspase- 1. 



10 The results from these experiments indicated 

that COP-1 inhibited association of pro-caspase-1 with 
RIP2 in a dose -dependent manner. Immunoblot analysis 
of lysates from these same cells demonstrated that COP- 
1 did not affect the total levels of pro-caspase-1 or 

15 RIP2, but rather just their association. These results 
therefore confirm that COP-1 can interfere with binding 
of pro-caspase-1 to RIP2 . 



14.3 COP-1 inhibition of caspase-1 -mediated 

activation of pro-IL-lfi . Active caspase-1 cleaves 
20 pro-IL-l(3, resulting in the generation of bioactive 
IL-iP which is secreted from cells. It was 
hypothesized that COP-1 could suppress 

caspase-1- induced pro-IL-l(3 processing and thus reduce 
secretion of IL-ip. 



25 To test this hypothesis, COS-7, 293T, or 

293HEK cells were co-transf ected in 12 well (22 mm in 
diameter) plates using Lipof ectamine Plus Reagent 
(GIBCO BRL, Grand Island, NY) with plasmids encoding 
mouse pro- IL- 1(3, human caspase-1, RIP2, or COP-1, in 

30 various amounts (total DNA = 2.0 yg) . At 1 day after 
transf ection, supernatants were collected and stored at 
-80°C or used immediately to quantify secretion of 



WO 01/90156 PCT/US01/17158 

131 

mature murine IL-lp into the culture medium by an ELISA 
assay, according to the manufacturer's protocol (R&D 
systems, Minneapolis, MM) . 

Co-expression of pro-caspase-1 and pro-IL-lp 
5 in COS-7 cells resulted in secretion of mature IL-1(B 
ranging from 80 pg/ml to 250 pg/ml, which was 
proportional to the amount of pro-caspase-1 plasmid 
used (Figure 17) . This IL-1£ secretion was enhanced by 
co-expression of RIP2 plasmid. In contrast, expression 

10 of COP-1 together with pro-caspase-1, pro-IL-13, and 
RIP2 resulted in a dose -dependent decrease in the 
amount of mature IL-lp secretion, proportional to the 
amount of COP-1 -encoding plasmid used (Figure 6)~ . 
Similar results were obtained using 293T or 293HEK 

15 cells. These results indicate that COP-1 is capable of 
suppressing the caspase-1 -mediated secretion of IL-1(3. 



15.0 Identification of COP-2. A human CARD -containing 
proteins, designated COP-2, for CARD -only protein 2, 
was identified and the gene and cDNA cloned. The 

20 predicted protein of COP-2 has high sequence similarity 
to the CARD-domain of human caspase-1. For COP-2, two 
primers based on the caspase-15 genomic sequence were 
designed, one in the middle of the CARD domain (5'- 
aagaagagacggctgcttatcaat-3 1 ; SEQ ID NO.-104) and the 

25 other in the catalytic domain 

(5 1 -ccacagcaggcctcgaagatgatc-3 ' ; SEQ IDNO:105). RT- 
RTR was performed, and a single band was observed, 
although the band size was smaller than expected for 
caspase-15. The PCR product was sequenced, and it was 

30 found that two exons were deleted and the catalytic 
domain was directly connected to the CARD' domain. 
However, due to a frameshift, a stop codon occurs just 
after the CARD domain, resulting in truncated protein 
arid no translation of the catalytic domain. 
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To clone the N-terminal region, a primer 
(5 5 -atgatcctcctgaagaagag-3 ! ; SEQ ID NO: 106) was 
designed with the genomic sequence in the most N- 
terminal portion of the CARD domain including ATG . RT- 
5 PCR was performed, and the PCR product was sequenced 
and found to be the same as in the genomic DNA. A 
merged construct containing both the N-terminal 
fragment and the CARD domain sequence was made by PCR . 



The COP- 2 cDNA sequence identified contained 
10 321 nucleotides (SEQ ID NO:89), and the deduced amino 
acid sequence (SEQ ID NO: 90) had a high level of 
identity with caspase-1. An alignment of COP-2 (SEQ ID 
NO: 90) and caspase-1 (SEQ ID NO: 87) is shown in Figure 
5, with the consensus sequence (SEQ ID NO: 91) shown 
15 above the aligned sequences. The amino acids shaded in 
black are identical. The stipled shading represents a 
match within 3 distance units. COP-2 is encoded by the 
caspase-15 gene (Figure 3), but COP-2 is a CARD only 
protein that lacks the caspase catalytic domain. 

2 0 COP-2 cDNA encodes a polypeptide with 

downstream termination codons, which result in shorter 
proteins containing a CARD domain without associated 
catalytic protease domains. COP-2 is therefore 
expected to function as trans -dominant inhibitor that 

2 5 likely prevents caspase activation by binding to the 
CARD-domains (pro-domains) in pro-enzymes such as pro- 
caspase-1 . 



COP-2 polypeptide is expected to function as 
A regulator of caspase-1 activation by enhancing or 
30 suppressing the activation of caspase-1. COP-2 binding 
activity is tested, for example, by making epitope 
tagged fusions with COP-2 and caspase-1 and 
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co-immunoprecipitating to determine binding 
interactions with caspase-1. Antibodies specific for 
COP-2 are also made. 



The effect of COP-2 on caspase-1 proteolytic 
5 activity is also tested. Methods for measuring caspase 
activity are well known (see, for example, Thornberry, 
Nature 356:768-774 (1992); Thornberry and Molineaux, 
Protein Science 4:3-12 (1995); Rano et al . , Chem. Biol. 
4:149-155 (1997); Fletcher et al . , J. Interferon 
10 Cytokine Res. 15:243-248 (1995)), and are also 
described above . 



All journal article, reference and patent 
citations provided above, in parentheses or otherwise, 
whether previously stated or not, are incorporated 
15 herein by reference in their entirety. 



Although the invention has been described 
with reference to the examples provided above, it 
should be understood that various modifications can be 
made without departing from the spirit of the 
2 0 invention. 
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1. An isolated nucleic acid molecule 
encoding a CARD-containing polypeptide, or a CARD, NB- 
ARC, ANGIO-R, LRR or SAM domain therefrom, selected 
5 from: 

(a) DNA encoding a polypeptide 
comprising the amino acid sequence set forth 
in SEQ ID NOS : 12, 168, 188, 170, 172, 174, 
176, 97, 99, 101, 103, 178, 180, 182, 184, 86 

10 and 9 0; and 

(b) DNA that hybridizes to the DNA of 
(a) under moderately stringent conditions, 
wherein said DNA encodes a biologically 
active polypeptide. 

15 2. The nucleic acid molecule of claim 1, 

wherein the nucleotide sequence of said nucleic acid 
molecule comprises any of SEQ ID NOS: 11, 167, 187, 169, 
171, 173, 175, 96, 98, 100, 102, 177, 179, 181, 183, 85 
and 8 9 . 

20 3. The nucleic acid molecule of claim 1, 

wherein said nucleic acid molecule is cDNA. 

4 . A vector containing the nucleic acid 
molecule of claim 1. 

5 . Recombinant cells containing the nucleic 
25 acid molecule of claim 1. 

6. An isolated oligonucleotide comprising 
at least 15 contiguous nucleotides of the nucleic acid 
molecule of claim 2 . 
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7. An oligonucleotide according to claim 6, 
wherein said oligonucleotide is labeled with a 
detectable marker. 

8. A kit for detecting the presence of 

5 CARD-encoding nucleic acid molecule comprising at least 
one oligonucleotide according to claim 6. 



9. An isolated CARD- containing polypeptide, 
or a CARD, NB-ARC, ANGIO-R, LRR or SAM domain 
therefrom, comprising an amino acid sequence at least 

10 70% identical to the amino acid sequence set forth in 

any of SEQ ID NOS:12, 168, 188, 170, 172, 174, 176, 97, 
99, 101, 103, 178, 180, 182, 184, 86 and 90. 

10. The CARD-containing polypeptide of claim 
15 9, wherein said polypeptide is encoded by a nucleotide 

sequence set forth as any of SEQ ID N0S:11, 167, 187, 
169, 171, 173, 175, 96, 98, 100, 102, 177, 179, 181, 
183, 85 and 89. 



11. A peptide, comprising at least 10 

20 contiguous amino acids of the polypeptide of claim 9. 

12. A method of producing a CARD-containing 
polypeptide comprising expressing the cDNA of claim 3 
in vitro or in a cell under conditions suitable for 
expression of said polypeptide, wherein said cells are 

25 selected from the group consisting of bacteria cells, 
yeast cells, plant cells, animal cells, mammalian cells 
and insect cells. 



30 



13. An isolated anti-CARD antibody having 
specific reactivity with the CARD-containing 
polypeptide of claim 9. 
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14. The antibody of claim 13, wherein said 
antibody is a monoclonal antibody. 



15. A cell line producing the monoclonal 
antibody of claim 14. 



16. The antibody of claim 13, wherein said 
antibody is a polyclonal antibody. 

17. A method for identifying a nucleic acid 
molecule encoding a CARD- containing polypeptide, said 
method comprising: 

contacting a sample containing nucleic acids 
with an oligonucleotide according to claim 6, wherein 
said contacting is effected under high stringency 
hybridization conditions, and identifying a nucleic 
acid molecule which hybridizes thereto. 



15 18. A method for detecting the presence of a 

CARD- containing polypeptide in a sample, said method 
comprising contacting a test sample with an antibody 
according to claim 13, detecting the presence of an 
antibody : CARD complex, and thereby detecting the 

20 presence of a human CARD-containing polypeptide in said 
test sample. 



19. A method of identifying a CARD- 
associated polypeptide (CAP) comprising the steps of: 

(a) contacting the CARD-containing 
25 polypeptide of claim 9 with a candidate CAP; 

(b) detecting association of said CARD- 
containing polypeptide with said CAP. 
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20. A method of identifying an effective 
agent that alters the association of a CARD- containing 
polypeptide with a CARD-associated polypeptide (CAP) , 
comprising the steps of: 

5 (a) contacting the CARD- containing 

polypeptide of claim 9 and said CAP under 
conditions that allow said CARD- containing 
polypeptide and said CAP to associate with an 
agent suspected of being able to alter the 

10 association of said CARD -containing. 

polypeptide and said CAP; and 

(b) detecting the altered association 
of said CARD-containing polypeptide and said 
CAP , wherein said altered association 

15 identifies an effective agent. 

21. A method of identifying an effective 
agent that alters the association of a CARD-containing 
polypeptide with a CARD-associated polypeptide (CAP) , 
comprising the steps of : 

20 (a) contacting the CARD-containing 

polypeptide of claim 9 and said CAP under 
conditions that allow said CARD-containing 
polypeptide and said CAP to associate with an 
agent suspected of being able to alter the 

25 association of said CARD-containing 

polypeptide and said CAP; and 

(b) detecting the altered association 
of said CARD-containing polypeptide and said 
CAP, wherein said altered association 

30 identifies an effective agent, wherein said 

CAP is a CARD-containing polypeptide 
according to claim 9. 
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22 . A method of altering the level of a 
biochemical process modulated by a CARD- containing 
polypeptide, comprising the steps of: 

(a) introducing the nucleic acid 
5 molecule of claim 1 into a cell; and 

(b) expressing said nucleic acid 
molecule in said cell, whereby the expression 
of said nucleic acid alters the level of a 
biochemical process modulated by a CARD- 

10 containing polypeptide. 

23. The method of claim 22, wherein said 
biochemical process modulated by a CARD- containing 
polypeptide is selected from the group consisting of 
apoptosis, NF-kB induction, cytokine processing, cJun 
15 N-terminal kinase induction, caspase-mediated 

proteolysis, transcription, inflammation and cell 
adhesion . 



24. A method of altering the level of a 
biochemical process modulated by a CARD- containing 

2 0 polypeptide, comprising introducing an antisense 

nucleotide sequence into a cell, wherein said antisense 
nucleotide sequence specifically hybridizes to a 
nucleic acid molecule encoding the CARD- containing 
polypeptide of claim 11, whereby hybridization reduces 

25 or inhibits the expression of said CARD- containing 
polypeptide in said cell. 



25. A method of altering the level of a 
biochemical process modulated by a CARD- containing 
polypeptide, comprising contacting a sample with an 
3 0 agent that effectively alters the association of the 
CARD -containing polypeptide of claim 9 with a 
CARD-associated polypeptide, whereby the level of a 
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biochemical process modulated by a CARD- containing 
polypeptide is altered. 

26. A method of diagnosing or predicting 
clinical prognosis of a pathology characterized by an 
increased or decreased level of a CARD-containing 
polypeptide in a subject, comprising the steps of: 

(a) obtaining a test sample from the 
sub j ect ; 

(b) contacting said test sample with an 
agent that can bind the CARD-containing 
polypeptide of claim 9 under suitable 
conditions, which allow specific binding of 
said agent to said CARD-containing 
polypeptide; and 

(c) comparing the amount of said 
specific binding in said test sample with the 
amount of specific binding in a reference 
sample, wherein an increased or decreased 
amount of said specific binding in said test 
sample as compared to said reference sample 
is diagnostic or predictive of clinical 
prognosis of a pathology. 

27. A composition comprising a compound 
selected from the group consisting of a CARD-containing 

25 polypeptide, a functional fragment therefrom, and an 
anti-CARD antibody; and a pharmaceutically acceptable 
carrier . 

28. A method of treating a pathology 
characterized by abnormal cell proliferation, abnormal 

30 cell death, or inflammation, said method comprising 
administering to an individual an effective amount of 
the composition of claim 27. 



10 



15 
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29. A chimeric polypeptide comprising a 
domain selected from the group consisting of SEQ ID 
NOS:168, 170, 172, 174, 176, 178, 180, 182 and 184. 

30. A method of identifying an effective 

5 agent that modulates an activity of a NB-ARC domain of 
a CARD- containing polypeptide, comprising the steps of: 

(a) contacting a polypeptide comprising 
an NB-ARC domain set forth as either of SEQ 
ID NOS:174 or 180 with an agent known or 

10 suspected of modulating an activity of an NB- 

ARC domain; and 

(b) measuring the activity of the NB- 
ARC domain, whereby an increase or decrease 
of said activity identifies said agent as an 

15 agent that modulates the activity of the NB- 

ARC domain of said CARD- containing 
polypeptide; 

wherein the activity of the NB-ARC domain of said CARD- 
containing polypeptide is selected from homo- 
2 0 oligomerization, hetero- oligomerization, nucleotide 
hydrolysis, and nucleotide binding. 
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SEQUENCE LISTING 

<110> The Burnham Institute 
-Jleed, John C. 
Pio, Frederick F. 
Godzik, Adam 
Stehlik, Christian 
Damiano, Jason S. 
Lee, Sug-Hyung 
Oliveira, Vasco A. 
Hayashi, Hideki 
Pawlowski , Krzysztof 

<120> Novel Card Domain Containing 

Polypeptides, Encoding Nucleic Acids, and Methods of Use 



<130> FP-LJ 4665 

<150> US 09/579,240 
<151> 2000-05-24 

<150> US 09/686, 347 
<151> 2000-10-10 

<150> US 60/275, 980 
<151> 2001-03-14 

<160> 195 

<170> FastSEQ for Windows Version 4.0 

<210> 1 

<400> 1 
000 

<210> 2 

<400> 2 
000 

<210> 3 

<400> 3 
000 

<210> 4 

<400> 4 
000 

<210> 5 
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<400> 5 
000 

<210> 6 

<400> 6 
000 

<210> 7 

<400> 7 
000 

<210> 8 

<400> 8 
000 

<210> 9 

<400> 9 
000 

<210> 10 

<400> 10 
000 

<210> 11 

<211> 1038 

<212> DNA 

<213> Homo sapien 

<220> 

<221> CDS 

<222> (1) . . . (930) 

<400> 11 

atg get acc gag agt act ccc tea gag ate ata gaa aga gaa aga aaa 48 

Met Ala Thr Glu Ser Thr Pro Ser Glu lie He Glu Arg Glu Arg Lys 
15 10 15 

aag ttg ctt gaa ate ctt caa cat gat cct gat tct ate tta gac acg 96 
Lys Leu Leu Glu He Leu Gin His Asp Pro Asp Ser He Leu Asp Thr 
20 25 30 

tta act tct egg agg ctg att tct gag gaa gag tat gag act ctg gag 144 
Leu Thr Ser Arg Arg Leu He Ser Glu Glu Glu Tyr Glu Thr Leu Glu 
35 40 45 

aat gtt aca gat etc ctg aag aaa agt egg aag ctg tta att ttg gta 192 
Asn Val Thr Asp Leu Leu Lys Lys Ser Arg Lys Leu Leu He Leu Val 
50 55 60 

cag aaa aag gga gag gcg acc tgt cag cat ttt etc aag tgt tta ttt 240 
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Gin Lys Lys Gly Glu Ala Thr Cys Gin His Phe Leu Lys Cys Leu Phe 
65 70 75 80 

agt act ttt cca cag tea get gec att tgc ggc tta agg cat gaa gtt 288 

Ser Thr Phe Pro Gin Ser Ala Ala He Cys Gly Leu Arg His Glu Val 
85 90 95 

tta aaa cat gag aat aca gta cct cct caa tct atg ggg gca age agt 336 

Leu Lys His Glu Asn Thr Val Pro Pro Gin Ser Met Gly Ala Ser Ser 
100 105 110 

aat tea gaa gat get ttt tct cct gga ata aaa cag cct gaa gee cct 384 

Asn Ser Glu Asp Ala Phe Ser Pro Gly He Lys Gin Pro Glu Ala Pro 

115 120 125 

gag ate aca gtg ttc ttc agt gag aag gaa cac ttg gat ttg gaa acc 432 

Glu He Thr Val Phe Phe Ser Glu Lys Glu His Leu Asp Leu Glu Thr 
130 135 140 

tct gag ttt ttc agg gac aag aaa act agt tat agg gaa aca get ttg 480 

Ser Glu Phe Phe Arg Asp Lys Lys Thr Ser Tyr Arg Glu Thr Ala Leu 
145 150 155 160 

tct gee agg aag aat gag aag gaa tat gac aca cca gaa gtc aca tta 528 

Ser Ala Arg Lys Asn Glu Lys Glu Tyr Asp Thr Pro Glu Val Thr Leu 
165 170 175 

tea tat tea gtt gag aaa gtt gga tgt gaa gtt cca gca act att aca 5 76 

Ser Tyr Ser Val Glu Lys Val Gly Cys Glu Val Pro Ala Thr He Thr 
180 185 190 

tat ata aaa gat gga cag aga tat gag gag eta gat gat tct tta tac 624 

Tyr He Lys Asp Gly Gin Arg Tyr Glu Glu Leu Asp Asp Ser Leu Tyr 

195 200 205 

tta gga aaa gag gaa tat eta gga tct gtt gac acc cct gaa gat gca 672 

Leu Gly Lys Glu Glu Tyr Leu Gly Ser Val Asp Thr Pro Glu Asp Ala 
210 215 220 

gaa gee act gtg gaa gag gag gtt tat gat gac cca gag cac gtt gga 720 

Glu Ala Thr Val Glu Glu Glu Val Tyr Asp Asp Pro Glu His Val Gly 
225 230 235 240 

tat gat ggt gaa gag gac ttc gag aat tea gaa acc aca gag ttc tct 768 

Tyr Asp Gly Glu Glu Asp Phe Glu Asn Ser Glu Thr Thr Glu Phe Ser 
245 250 255 

ggt gaa gaa cca agt tat gag gga tea gaa acc age ctt tea ttg gag 816 

Gly Glu Glu Pro Ser Tyr Glu Gly Ser Glu Thr Ser Leu Ser Leu Glu 
260 265 270 

gag gaa cag gag aaa agt ata gaa ggc tgg tct cga act cat ggg ctt 864 

Glu Glu Gin Glu Lys Ser He Glu Gly Trp Ser Arg Thr His Gly Leu 

275 280 285 
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aag cga tec tec cac gtt ggc etc cca aag tgc tgg gat tac agg cgt 912 
Lys Arg Ser Ser His Val Gly Leu Pro Lys Cys Trp Asp Tyr Arg Arg 
290 295 300 

gag cca ccc tgc ctg gec tgaaaattct gcctcaaaca tctcaaacat 960 
Glu Pro Pro Cys Leu Ala 
305 310 

ccatttatat tttgtacaag aaagtaaata aaatttttct ttttaacatt aaaaaaaaaa 1020 
aaaaaaaaaa aatctaga 103 8 

<210> 12 

<211> 310 

<212> PRT 

<213> Homo sapien 

<400> 12 

Met Ala Thr Glu Ser Thr Pro Ser Glu lie lie Glu Arg Glu Arg Lys 

15 10 15 

Lys Leu Leu Glu lie Leu Gin His Asp Pro Asp Ser lie Leu Asp Thr 

20 25 30 

Leu Thr Ser Arg Arg Leu lie Ser Glu Glu Glu Tyr Glu Thr Leu Glu 

35 40 45 

Asn Val Thr Asp Leu Leu Lys Lys Ser Arg Lys Leu Leu lie Leu Val 

50 55 60 

Gin Lys Lys Gly Glu Ala Thr Cys Gin His Phe Leu Lys Cys Leu Phe 
65 70 75 80 

Ser Thr Phe Pro Gin Ser Ala Ala lie Cys Gly Leu Arg His Glu Val 

85 90 95 

Leu Lys His Glu Asn Thr Val Pro Pro Gin Ser Met Gly Ala Ser Ser 

100 105 110 

Asn Ser Glu Asp Ala Phe Ser Pro Gly lie Lys Gin Pro Glu Ala Pro 

115 120 125 

Glu He Thr Val Phe Phe Ser Glu Lys Glu His Leu Asp Leu Glu Thr 

130 135 140 

Ser Glu Phe Phe Arg Asp Lys Lys Thr Ser Tyr Arg Glu Thr Ala Leu 
145 150 155 160 

Ser Ala Arg Lys Asn Glu Lys Glu Tyr Asp Thr Pro Glu Val Thr Leu 

165 170 175 

Ser Tyr Ser Val Glu Lys Val Gly Cys Glu Val Pro Ala Thr He Thr 

180 185 190 

Tyr He Lys Asp Gly Gin Arg Tyr Glu Glu Leu Asp Asp Ser Leu Tyr 

195 200 205 

Leu Gly Lys Glu Glu Tyr Leu Gly Ser Val Asp Thr Pro Glu Asp Ala 

210 215 220 

Glu Ala Thr Val Glu Glu Glu Val Tyr Asp Asp Pro Glu His Val Gly 
225 230 235 240 

Tyr Asp Gly Glu Glu Asp Phe Glu Asn Ser Glu Thr Thr Glu Phe Ser 

245 250 255 

Gly Glu Glu Pro Ser Tyr Glu Gly Ser Glu Thr Ser Leu Ser Leu Glu 

260 265 270 

Glu Glu Gin Glu Lys Ser He Glu Gly Trp Ser Arg Thr His Gly Leu 

275 280 285 

Lys Arg Ser Ser His Val Gly Leu Pro Lys Cys Trp Asp Tyr Arg Arg 
290 295 300 
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Glu Pro 
305 

<210> 13 

<400> 13 
000 

<210> 14 

<400> 14 
000 

<210> 15 

<400> 15 
000 

<210> 16 

<400> 16 
000 

<210> 17 

<400> 17 
000 

<210> 18 

<400> 18 
000 

<210> 19 

<400> 19 
000 

<210> 20 

<400> 20 
000 

<210> 21 

<400> 21 
000 

<210> 22 

<400> 22 
000 

<210> 23 



Pro Cys 



Leu Ala 

31C 
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<400> 23 
000 

<210> 24 

<400> 24 
000 

<210> 25 

<400> 25 
000 

<210> 26 

<400> 26 
000 

<210> 27 

<400> 27 
000 

<210> 28 

<400> 28 
000 

<210> 29 

<400> 29 
000 

<210> 30 

<400> 30 
000 

<210> 31 

<400> 31 
000 

<210> 32 

<400> 32 
000 

<210> 33 

<400> 33 
000 

<210> 34 
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<400> 34 
000 

<210> 35 

<400> 35 
000 

<210> 36 

<400> 36 
000 

<210> 37 

<400> 37 
000 

<210> 38 

<400> 38 
000 

<210> 39 

<400> 39 
000 

<210> 40 

<400> 40 
000 

<210> 41 

<400> 41 
000 

<210> 42 

<400> 42 
000 

<210> 43 

<400> 43 
000 

<210> 44 

<400> 44 
000 

<210> 45 
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<400> 45 
000 

<210> 46 

<400> 46 
000 

<210> 47 

<400> 47 
000 

<210> 48 

<400> 48 
000 

<210> 49 

<400> 49 
000 

<210> 50 

<400> 50 
000 

<210> 51 

<400> 51 
000 

<210> 52 

<400> 52 
000 

<210> 53 

<400> 53 
000 

<210> 54 

<400> 54 
000 

<210> 55 

<400> 55 
000 

<210> 56 
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<400> 56 
000 

<210> 57 

<400> 57 
000 

<210> 58 

<400> 58 
000 

<210> 59 

<400> 59 
000 

<210> 60 

<400> 60 
000 

<210> 61 

<400> 61 
000 

<210> 62 

<400> 62 
000 

<210> 63 

<400> 63 
000 

<210> 64 

<400> 64 
000 

<210> 65 

<400> 65 
000 

<210> 66 

<400> 66 
000 

<210> 67 
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<400> 67 
000 

<210> 68 

<400> 68 
000 

<210> 69 

<400> 69 
000 

<210> 70 

<400> 70 
000 

<210> 71 

<400> 71 
000 

<210> 72 

<400> 72 
000 

<210> 73 

<400> 73 
000 

<210> 74 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 74 

tacttacttt gtcccttca 19 

<210> 75 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 75 

tatttgtccc catctcgtc 19 
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<210> 76 
<21i> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 76 

cggaattcat ggctaccgag agtactcc 

<210> 77 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 77 

gtaaaacgac ggccagt 

<210> 78 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 78 

gcagaagcca ctgtggaaga ggaggtt 

<210> 79 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 79 

atacgactca ctatagggcg aattggcc 

<210> 80 

<220> 

<223> Primer 

<400> 80 
000 

<210> 81 



17 



27 



28 
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<220> 

<223> Primer 

<400> 81 
000 

<210> 82 

<211> 3030 

<212> DNA 

<213> Homo sapien 

<220> 

<221> CDS 

<222> (1) . . . (1680) 

<400> 82 

tgt gaa atg tgc teg cag gag get ttt cag gca cag agg age cag ctg 48 
Cys Glu Met Cys Ser Gin Glu Ala Phe Gin Ala Gin Arg Ser Gin Leu 
1 5 10 15 

gtc gag ctg ctg gtc tea ggg tec ctg gaa ggc ttc gag agt gtc ctg 96 
Val Glu Leu Leu Val Ser Gly Ser Leu Glu Gly Phe Glu Ser Val Leu 
20 25 30 

gac tgg ctg ctg tec tgg gag gtc etc tec tgg gag gac tac gag ggc 144 
Asp Trp Leu Leu Ser Trp Glu Val Leu Ser Trp Glu Asp Tyr Glu Gly 
35 40 45 

ttc cac etc ctg ggc cag cct etc tec cac ttg gee agg cgc ctt ctg 192 
Phe His Leu Leu Gly Gin Pro Leu Ser His Leu Ala Arg Arg Leu Leu 
50 55 60 

gac acc gtc tgg aat aag ggt act tgg gee tgt cag aag etc ate gcg 240 
Asp Thr Val Trp Asn Lys Gly Thr Trp Ala Cys Gin Lys Leu lie Ala 
65 70 75 80 

get gee caa gaa gee cag gee gac age cag tec ccc aag ctg cat ggc 288 
Ala Ala Gin Glu Ala Gin Ala Asp Ser Gin Ser Pro Lys Leu His Gly 
85 90 95 

tgc tgg gac ccc cac teg etc cac cca gec cga gac ctg cag agt cac 336 
Cys Trp Asp Pro His Ser Leu His Pro Ala Arg Asp Leu Gin Ser His 
100 105 HO 

egg cca gee att gtc agg agg etc cac age cat gtg gag aac atg ctg 3 84 
Arg Pro Ala He Val Arg Arg Leu His Ser His Val Glu Asn Met Leu 
115 120 125 

gac ctg gca tgg gag egg ggt ttc gtc age cag tat gaa tgt gat gaa 432 
Asp Leu Ala Trp Glu Arg Gly Phe Val Ser Gin Tyr Glu Cys Asp Glu 
130 135 140 

ate agg ttg ccg ate ttc aca ccg tec cag agg gca aga agg ctg ctt 480 
He Arg Leu Pro He Phe Thr Pro Ser Gin Arg Ala Arg Arg Leu Leu 
145 150 155 160 
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gat ctt gcc acg gtg aaa gcg aat gga ttg get gec ttc ctt eta caa 528 
Asp Leu Ala Thr Val Lys Ala Asn Gly Leu Ala Ala Phe Leu Leu Gin 
165 170 175 

cat gtt cag gaa tta cca gtc cca ttg gcc ctg cct ttg gaa get gcc 576 
His Val Gin Glu Leu Pro Val Pro Leu Ala Leu Pro Leu Glu Ala Ala 
180 185 190 

aca tgc aag aag tat atg gcc aag ctg agg acc acg gtg tct get cag 624 
Thr Cys Lys Lys Tyr Met Ala Lys Leu Arg Thr Thr Val Ser Ala Gin 
195 200 205 

tct cgc ttc etc agt acc tat gat gga gca gag acg etc tgc ctg gag 672 
Ser Arg Phe Leu Ser Thr Tyr Asp Gly Ala Glu Thr Leu Cys Leu Glu 
210 215 220 

gac ata tac aca gag aat gtc ctg gag gtc tgg gca gat gtg ggc atg 720 
Asp lie Tyr Thr Glu Asn Val Leu Glu Val Trp Ala Asp Val Gly Met 
225 230 235 240 

get gga ccc ccg cag aag age cca gcc acc ctg ggc ctg gag gag etc 768 
Ala Gly Pro Pro Gin Lys Ser Pro Ala Thr Leu Gly Leu Glu Glu Leu 
245 250 255 

ttc age acc cct ggc cac etc aat gac gat gcg gac act gtg ctg gtg 816 
Phe Ser Thr Pro Gly His Leu Asn Asp Asp Ala Asp Thr Val Leu Val 
260 265 270 

ggt gag gcg ggc agt ggc aag age acg etc ctg cag egg ctg cac 86 4 
Val Gly Glu Ala Gly Ser Gly Lys Ser Thr Leu Leu Gin Arg Leu His 
275 280 285 

ttg ctg tgg get gca ggg caa gac ttc cag gaa ttt etc ttt gtc ttc 912 
Leu Leu Trp Ala Ala Gly Gin Asp Phe Gin Glu Phe Leu Phe Val Phe 
290 295 300 

cca ttc age tgc egg cag ctg cag tgc atg gcc aaa cca etc tct gtg 960 
Pro Phe Ser Cys Arg Gin Leu Gin Cys Met Ala Lys Pro Leu Ser Val 
305 310 315 320 

egg act eta etc ttt gag cac tgc tgt tgg cct gat gtt ggt caa gaa 100 8 
Arg Thr Leu Leu Phe Glu His Cys Cys Trp Pro Asp Val Gly Gin Glu 
325 330 335 

gac ate ttc cag tta etc ctt gac cac cct gac cgt gtc ctg tta acc 1056 
Asp lie Phe Gin Leu Leu Leu Asp His Pro Asp Arg Val Leu Leu Thr 
340 345 350 



ttt gat ggc ttt gac gag ttc aag ttc agg ttc acg gat cgt gaa cgc 
Phe Asp Gly Phe Asp Glu Phe Lys Phe Arg Phe Thr Asp Arg Glu Arg 
355 360 365 



1104 



cac tgc tec ccg acc gac ccc acc tct gtc cag acc ctg etc ttc aac 1152 
His Cys Ser Pro Thr Asp Pro Thr Ser Val Gin Thr Leu Leu Phe Asn 
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370 375 380 

ctt ctg cag ggc aac ctg ctg aag aat gcc cgc aag gtg gtg acc age 1200 
Leu Leu Gin Gly Asn Leu Leu Lys Asn Ala Arg Lys Val Val Thr Ser 
385 390 395 400 

cgt ccg gcc get gtg teg gcg ttc etc agg aag tac ate cgc acc gag 1248 
Arg Pro Ala Ala Val Ser Ala Phe Leu Arg Lys Tyr lie Arg Thr Glu 
405 410 415 

ttc aac etc aag ggc ttc tct gaa cag ggc ate gag ctg tac ctg agg 1296 
Phe Asn Leu Lys Gly Phe Ser Glu Gin Gly lie Glu Leu Tyr Leu Arg 
420 425 . 430 

aag cgc cat cat gag ccc ggg gtg gcg gac cgc etc ate cgc ctg etc 1344 
Lys Arg His His Glu Pro Gly Val Ala Asp Arg Leu lie Arg Leu Leu 
435 440 445 

caa gag acc tea gcc ctg cac ggt ttg tgc cac ctg cct gtc ttc tea 1392 
Gin Glu Thr Ser Ala Leu His Gly Leu Cys His Leu Pro Val Phe Ser 
450 455 460 

tgg atg gtg tec aaa tgc cac cag gaa ctg ttg ctg cag gag ggg ggg 1440 
Trp Met Val Ser Lys Cys His Gin Glu Leu Leu Leu Gin Glu Gly Gly 
465 470 475 480 

tec cca aag acc act aca gat atg tac ctg ctg att ctg cag cat ttt 1488 
Ser Pro Lys Thr Thr Thr Asp Met Tyr Leu Leu lie Leu Gin His Phe 
485 490 495 

ctg ctg cat gcc acc ccc cca gac tea get tec caa ggt ctg gga ccc 1536 
Leu Leu His Ala Thr Pro Pro Asp Ser Ala Ser Gin Gly Leu Gly Pro 
500 505 510 

agt ctt ctt egg ggc cgc etc ccc acc etc ctg cac ctg ggc aga ctg 1584 
Ser Leu Leu Arg Gly Arg Leu Pro Thr Leu Leu His Leu Gly Arg Leu 
515 520 525 

get ctg tgg ggc ctg ggc atg tgc tgc tac gtg ttc tea gcc cag cag 1632 
Ala Leu Trp Gly Leu Gly Met Cys Cys Tyr Val Phe Ser Ala Gin Gin 
530 535 540 

etc cag gca gca cag gtc age cct gat gac att tct ctt ggc ttc ctg 1680 
Leu Gin Ala Ala Gin Val Ser Pro Asp Asp lie Ser Leu Gly Phe Leu 
545 550 555 560 

gtgcgtgcca aaggtgtcgt gecagggagt acggcgcccc tggaattcct tcacatcact 1740 
ttccagtgct tctttgccgc gttctacctg gcactcagtg ctgatgtgcc accagctttg 1800 
ctcagacacc tcttcaattg tggcaggeca ggcaactcac caatggccag gctcctgccc 1860 
acgatgtgca tccaggcctc ggagggaaag gaeagcageg tggcagcttt getgeagaag 1920 
gccgagccgc acaaccttca gatcacagca gccttcctgg cagggctgtt gtccegggag 1980 
cactggggcc tgctggctga gtgecagaca tctgagaagg ccctgctccg gcgccaggcc 2040 
tgtgcccgct ggtgtctggc ccgcagcctc cgcaagcact tccactccat cccgccagct 2100 
geacegggtg aggecaagag cgtgcatgcc atgcccgggt tcatctggct catccggagc 2160 
ctgtacgaga tgcaggagga gcggctggct eggaaggctg cacgtggcct gaatgttggg 2220 
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cacctcaagt tgacattttg cagtgtgggc cccactgagt gtgctgccct ggcctttgtg 2280 
ctgcagcacc tccggcggcc cgtggccctg cagctggact acaactctgt gggtgacatt 2340 
ggcgtggagc agctgctgcc ttgccttggt gtctgcaagg ctctgtattt gcgcgataac 2400 
aatatctcag accgaggcat ctgcaagctc attgaatgtg ctcttcactg cgagcaattg 2460 
cagaagttag cgctggggaa taactacatc actgccgcgg gagcccaagt gctggccgag 2 52 0 
gggctccgag gcaacacctc cttgcagttc ctgggattct ggggcaacag agtgggtgac 2580 
gagggggccc aggccctggc tgaagccttg ggtgatcacc agagcttgag gtggctcagc 2640 
ctggtgggga acaacattgg cagtgtgggt gcccaagcct tggcactgat gctggcaaag 2700 
aacgtcatgc tagaagaact ctgcctggag gagaaccatc tccaggatga aggtgtatgt 2 760 
tctctcgcag aaggactgaa gaaaaattca agtttgaaaa tcctgaacat aaaaattcat 2820 
gcttcgggat tcaacaaact cttggaaagc attttctgca tcctcctggt tgtggaagca 2880 
tttttcctgc agaaagttgt caagattctt gaagaaatgg tagtcagttg gctagaggtc 2940 
aggttgtcca ataactgcat cacctaccta ggggcagaag ccctcctgca ggcccttgaa 3 000 
aggaatgaca ccatcctgga agtctggtaa 3 030 

<210> 83 

<211> 560 

<212> PRT 

<213> Homo sapien 

<400> 83 

Cys Glu Met Cys Ser Gin Glu Ala Phe Gin Ala Gin Arg Ser Gin Leu 

15 10 15 

Val Glu Leu Leu Val Ser Gly Ser Leu Glu Gly Phe Glu Ser Val Leu 

20 25 30 

Asp Trp Leu Leu Ser Trp Glu Val Leu Ser Trp Glu Asp Tyr Glu Gly 

35 40 45 

Phe His Leu Leu Gly Gin Pro Leu Ser His Leu Ala Arg Arg Leu Leu 

50 55 60 

Asp Thr Val Trp Asn Lys Gly Thr Trp Ala Cys Gin Lys Leu lie Ala 
65 70 75 80 

Ala Ala Gin Glu Ala Gin Ala Asp Ser Gin Ser Pro Lys Leu His Gly 

85 90 95 

Cys Trp Asp Pro His Ser Leu His Pro Ala Arg Asp Leu Gin Ser His 

100 105 110 

Arg Pro Ala lie Val Arg Arg Leu His Ser His Val Glu Asn Met Leu 

115 120 125 

Asp Leu Ala Trp Glu Arg Gly Phe Val Ser Gin Tyr Glu Cys Asp Glu 

130 135 140 

lie Arg Leu Pro lie Phe Thr Pro Ser Gin Arg Ala Arg Arg Leu Leu 
145 150 155 160 

Asp Leu Ala Thr Val Lys Ala Asn Gly Leu Ala Ala Phe Leu Leu Gin 

165 170 175 

His Val Gin Glu Leu Pro Val Pro Leu Ala Leu Pro Leu Glu Ala Ala 

180 185 190 

Thr Cys Lys Lys Tyr Met Ala Lys Leu Arg Thr Thr Val Ser Ala Gin 

195 200 205 

Ser Arg Phe Leu Ser Thr Tyr Asp Gly Ala Glu Thr Leu Cys Leu Glu 

210 215 220 

Asp lie Tyr Thr Glu Asn Val Leu Glu Val Trp Ala Asp Val Gly Met 
225 230 235 240 

Ala Gly Pro Pro Gin Lys Ser Pro Ala Thr Leu Gly Leu Glu Glu Leu 

245 250 255 

Phe Ser Thr Pro Gly His Leu Asn Asp Asp Ala Asp Thr Val Leu Val 
260 265 270 
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Val 


Gly 


Glu Ala 


Gly Ser 


Gly 


Lys 


Ser 


Thr 


Leu 


Leu 


Gin 


Arcr 


Leu 


His 






275 










280 










285 








Leu 


Leu 


Trp Ala 


Ala 


Gly 


Gin 


Asp 


Phe 


Gin 


Glu 


Phe 


Leu 


Phe 


Val 


Phe 




290 










295 










300 










Pro 


Phe 


Ser 


Cys 


Arg 


Gin 


Leu 


Gin 


Cvs 


Met 


Ala 


Lys 


Pro 


Leu 


Ser 


Val 


305 










310 










315 










~j & kj 


Arg 


Thr 


Leu 


Leu 


Phe 


Glu 


His 


Cys 


Cvs 


Tro 


Pro 


Asp 


Val 


Glv 


Gin 


Glu 










325 










330 










33 5 




Asp 


lie 


Phe 


Gin 


Leu 


Leu 


Leu 


Asp 


His 


Pro 


Asp 


Arg 


Val 


Leu 


Leu 


Thr 








340 










345 










J) 3 \J 






Phe 


Asp 


Gly Phe 


Asp Glu 


Phe 


Lys 


Phe 




Phe 


Thr 


Asp 


Arg 


UX LI 








355 










360 










JO J 








His 


Cvs 


Ser 


Pro 


Thr Asp 


Pro 


Thr 


Ser 


Val 

V CI -L 






T.oi i 
i-IC <U 


T ./si i 


irxie 


Asn 




370 










^7 R 










~i q n 










Leu 


Leu 


Gin Gly 


Asn 


Leu 


Leu 


Lys 




•A "I -> 


Arg 


Lys 


vai 


val 


Thr 


Ser 


385 










390 










T QC 

J ? J 










400 




Pro 


Ala 


Ala 


Val 


Ser 


7\ "1 3 
Ala 


•flic 


Leu 


Arg 


Lys 


Tyr 


lie 


Arg 


Thr 


G1U 










405 










/inn 










415 




Phe 


Asn 


Leu Lys 


Gly Phe 


O t: JL 


m n 

VjjX U. 


UlU 


, r 

oiy 


116 


LilU 


Leu 


Tyr 


Leu 


Arg 








420 










A o c 
4^ o 










430 






JLl jr o 


Arg 


His 


His 


Glu 


Pro 


r2*i , , 


v ax 


ax a 


Asp 


Arg 


Leu 


lie 


Arg 


Leu 


Leu 






435 










a a n 










445 








dl n 

OJLI1 




Thr 


Ser 


Ala 


Leu 


rlXS 


uiy 


Leu 


Cys 


HIS 


Leu 


Pro 


Val 


Pne 


Ser 




4oU 










455 










460 










Trp 




Val 


Ser 


Lys 


Cys 


rllS 


bin 


blU 


Leu 


Leu 


Leu 


Gin 


Glu 


Gly 


Gly 


4ob 










470 










475 










480 


Ser 


Pro 


Lys 


Thr 


Thr 


Thr 


Asp 


Met 


Tyr 


Leu 


Leu 


lie 


Leu 


Gin 


His 


Phe 










485 










490 










495 




Leu 


Leu 


His 


Ala 


Thr 


Pro 


Pro 


Asp 


Ser 


Ala 


Ser 


Gin 


Gly 


Leu 


Gly 


Pro 








500 










505 










510 






Ser 


Leu 


Leu 


Arg 


Gly Arg 


Leu 


Pro 


Thr 


Leu 


Leu 


His 


Leu 


Gly 


Arg 


Leu 






515 










520 










525 








Ala 


Leu 


Trp Gly 


Leu 


Gly 


Met 


Cys 


Cys 


Tyr 


Val 


Phe 


Ser 


Ala 


Gin 


Gin 




530 










535 










540 










Leu 


Gin 


Ala 


Ala 


Gin 


Val 


Ser 


Pro 


Asp 


Asp 


lie 


Ser 


Leu 


Gly 


Phe 


Leu 


545 










550 










555 










560 



<210> 84 

<211> 1107 

<212> DNA 

<213> Homo sapien 

<400> 84 

attctttttt taacttttac ttattcatta ggatgatttc ataatatatt tcctggttta 60 

gaggaaacag gaacaatggc taccgagagt actccctcag agatcataga aagagaaaga 120 

aaaaagttgc ttgaaatcct tcaacatgat cctgattcta tcttagacac gttaacttct 180 

cggaggctga tttctgagga agagtatgag actctggaga atgttacaga tctcctgaag 240 

aaaagtcgga agctgttaat tttggtacag aaaaagggag aggcgacctg tcagcatttt 300 

ctcaagtgtt tatttagtac ttttccacag tcagctgcca tttgcggctt aaggcatgaa 360 

gttttaaaac atgagaatac agtacctcct caatctatgg gggcaagcag taattcagaa 420 

gatgcttttt ctcctggaat aaaacagcct gaagcccctg agatcacagt gttcttcagt 480 

gagaaggaac acttggattt ggaaacctct gagtttttca gggacaagaa aactagttat 540 

agggaaacag ctttgtctgc caggaagaat gagaaggaat atgacacacc agaagtcaca 600 

ttatcatatt cagttgagaa agttggatgt gaagttccag caactattac atatataaaa 660 
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gatggacaga gatatgagga gctagatgat tctttatact taggaaaaga ggaatatcta 720 

ggatctgttg acacccctga agatgcagaa gccactgtgg aagaggaggt ttatgatgac 780 

ccagagcacg ttggatatga tggtgaagag gacttcgaga attcagaaac cacagagttc 840 

tctggtgaag aaccaagtta tgagggatca gaaaccagcc tttcattgga ggaggaacag 900 

gagaaaagta tagaaggctg gtctcgaact catgggctta agcgatcctc ccacgttggc 96 0 

ctcccaaagt getgggatca caggcgtgag ccaccctgcc tggcctgaaa attctgcctc 1020 

aaacatctca aacatccatt tatattttgt acaagaaagt aaataaaatt tttcttttta 1080 
acattaaaaa aaaaaaaaaa aaaaaaa 1107 

<210> 85 

<211> 510 

<212> DNA 

<213> Homo sapien 



<220> 

<221> CDS 

<222> (15) . . . (305) 



<400> 85 

aggagagaaa agcc atg gcc gac aag gtc ctg aag gag aag aga aag ctg 5 0 

Met Ala Asp Lys Val Leu Lys Glu Lys Arg Lys Leu 
15 10 



ttt ate cat tec atg ggt gaa ggt aca ata aat ggc tta ctg gat gaa 98 

Phe lie His Ser Met Gly Glu Gly Thr He Asn Gly Leu Leu Asp Glu 

15 20 25 

tta tta cag aca agg gtg ctg aac cag gaa gag atg gag aaa gta aaa 14 6 

Leu Leu Gin Thr Arg Val Leu Asn Gin Glu Glu Met Glu Lys Val Lys 

30 35 40 



cgt gaa aat get aca gtt atg gat aag acc cga get ttg att gac tec 194 
Arg Glu Asn Ala Thr Val Met Asp Lys Thr Arg Ala Leu He Asp Ser 
45 50 55 60 



gtt att ccg aaa ggg gca cag gca tgc caa att tgc ate aca tac att 242 
Val He Pro Lys Gly Ala Gin Ala Cys Gin He Cys He Thr Tyr He 
65 70 75 



tgt gaa gaa gac agt tac ctg gca gag acg ctg gga etc tea gca ggt 290 
Cys Glu Glu Asp Ser Tyr Leu Ala Glu Thr Leu Gly Leu Ser Ala Gly 
80 85 90 

ccg ata cct gga aat tagcttagct tagtacacaa gactcccaat tactattttc 345 
Pro He Pro Gly Asn 
95 



ttccttccca gctcttcagg cagtgcagga 
agaaggcaga atcaagcttt gctttctaga 
gcagaggtgc catgttcaga atacaataat 

<210> 86 

<211> 97 

<212> PRT 

<213> Homo sapien 



caacccagct atgcccacat gctcaagccc 405 
agaegctcaa aggatatgga aacaaaagtt 465 
aaagtggagt aaaga 510 
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<400> 86 

Met Ala Asp Lys Val 

1 5 
Met Gly Glu Gly Thr 

20 

Arg Val Leu Asn Gin 
35 

Thr Val Met Asp Lys 
50 

Gly Ala Gin Ala Cys 
65 

Ser Tyr Leu Ala Glu 
85 

Asn 



Leu Lys Glu Lys Arg Lys 
10 

lie Asn Gly Leu Leu Asp 
25 

Glu Glu Met Glu Lys Val 
40 

Thr Arg Ala Leu lie Asp 
55 

Gin lie Cys lie Thx Tyr 
70 75 
Thr Leu Gly Leu Ser Ala 
90 



Leu Phe lie His Ser 
15 

Glu Leu Leu Gin Thr 
30 

Lys Arg Glu Asn Ala 
45 

Ser Val lie Pro Lys 
60 

lie Cys Glu Glu Asp 
80 

Gly Pro lie Pro Gly 
95 



<210> 87 

<211> 97 

<212> PRT 

<213> Homo sapien 



<400> 87 



Met Ala 


Asp Lys Val 


Leu 


Lys Glu Lys Arg Lys Leu 


Phe 


He 


Arg 


Ser 


1 


5 




10 






15 




Met Gly 


Glu Gly Thr 


lie 


Asn Gly Leu Leu Asp Glu 


Leu 


Leu 


Gin 


Thr 




20 




25 




30 






Arg Val 


Leu Asn Lys 


Glu 


Glu Met Glu Lys Val Lys 


Arg 


Glu 


Asn 


Ala 




35 




40 


45 








Thr Val 


Met Asp Lys 


Thr 


Arg Ala Leu lie Asp Ser 


Val 


He 


Pro 


Lys 


50 






55 60 








Gly Ala 


Gin Ala Cys 


Gin 


lie Cys He Thr Tyr He 


Cys 


Glu 


Glu 


Asp 


65 




70 


75 








80 


Ser Tyr 


Leu Ala Gly 


Thr 


Leu Gly Leu Ser Ala Asp 


Gin 


Thr 


Ser 


Gly 




85 




90 






95 





Asn 



<210> 88 

<400> 88 
000 

<210> 89 

<211> 321 

<212> DNA 

<213> Homo sapien 

<220> 

<221> CDS 

<222> (1) . . . (318) 

<400> 89 
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atg ate etc ctg aag aag aga egg ctg ctt ate aat tea ttg ggt gaa 4 8 
Met lie Leu Leu Lys Lys Arg Arg Leu Leu lie Asn Ser Leu Gly Glu 
15 10 15 

ggt aca ata aat ggc tta ctg gat gaa tta ttg gag aca aat gtg ctg 96 
Gly Thr lie Asn Gly Leu Leu Asp Glu Leu Leu Glu Thr Asn Val Leu 
20 25 30 

age cag gaa gac aca gag ata gta aaa tgt gaa aat gtt aca gtt ate 144 
Ser Gin Glu Asp Thr Glu lie Val Lys Cys Glu Asn Val Thr Val lie 
35 40 45 

gat aag gee cga gat ttg ctt gac tct gtt att egg aaa ggg gca ggg 192 
Asp Lys Ala Arg Asp Leu Leu Asp Ser Val lie Arg Lys Gly Ala Gly 
50 55 60 

gca tgt gaa att tgc ate aca tac att tgt gaa gaa gac agg tac ctg 240 
Ala Cys Glu lie Cys He Thr Tyr He Cys Glu Glu Asp Arg Tyr Leu 
65 70 75 80 

gca ggg acg ctg gga etc tea gca gga aat gac tac aga get gga ggc 288 
Ala Gly Thr Leu Gly Leu Ser Ala Gly Asn Asp Tyr Arg Ala Gly Gly 
85 90 95 

att tgc tea ccg ccc aga gca caa gac etc tga 321 
He Cys Ser Pro Pro Arg Ala Gin Asp Leu 
100 105 



<210> 90 

<211> 106 

<212> PRT 

<213> Homo sapien 

<400> 90 

Met He Leu Leu Lys Lys Arg Arg Leu Leu He Asn Ser Leu Gly Glu 

15 10 15 

Gly Thr He Asn Gly Leu Leu Asp Glu Leu Leu Glu Thr Asn Val Leu 

20 25 30 

Ser Gin Glu Asp Thr Glu He Val Lys Cys Glu Asn Val Thr Val He 

35 40 45 

Asp Lys Ala Arg Asp Leu Leu Asp Ser Val He Arg Lys Gly Ala Gly 

50 55 60 

Ala Cys Glu He Cys He Thr Tyr He Cys Glu Glu Asp Arg Tyr Leu 
65 70 75 80 

Ala Gly Thr Leu Gly Leu Ser Ala Gly Asn Asp Tyr Arg Ala Gly Gly 

85 90 95 

He Cys Ser Pro Pro Arg Ala Gin Asp Leu 
100 105 



<210> 91 

<211> 108 

<212> PRT 

<213> Homo sapien 
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<400> 91 

Met Ala Asp Lys Val Leu Leu Glu 

1 5 
Leu Gly Glu Gly Thr He Asn Gly 
20 

Asn Val Leu Ser Gin Glu Asp Glu 

35 40 
Val He Asp Lys Ala Arg Ala Leu 

50 55 
Ala Gly Ala Cys Glu He Cys He 
65 70 
Tyr Leu Ala Gly Thr Leu Gly Leu 
85 

Gly Gly Ala Cys Ser Thr Ser Ser 
100 



Lys Arg Lys Leu Leu He Asn Ser 

10 15 
Leu Leu Asp Glu Leu Leu Glu Thr 
25 30 
He Val Lys Arg Glu Asn Ala Thr 
45 

Leu Asp Ser Val lie Arg Lys Gly 
60 

Thr Tyr He Cys Glu Glu Asp Ser 

75 80 
Ser Ala Gly Asn Ala Val Gin Ala 

90 95 
Gly Gin Asp Leu 
105 



<210> 92 

<400> 92 
000 

<210> 93 

<400> 93 
000 

<210> 94 

<400> 94 
000 

<210> 95 

<400> 95 
000 

<210> 96 

<211> 3396 

<212> DNA 

<213> Homo sapien 

<220> 
<221> CDS 

<222> (277) . . . (3348) 
<400> 96 

cgcccgggca ggtgtttata ctccggaggg 
aaaactggtg atctgtttgc cctgtgtgac 
acatctgctg gaagtcctct gggattcaag 
aaaagaggac aatattggga tcacctttga 
gtgttataga aaggtgggaa gctttcatcc 



tgtccccgtg cgtcatcggt ggagtggaqc 60 
cttgcccaga accctgctga ctgagagaac 12 0 
gtacagggaa tgaagagtag ttttacagaa 180 
cctttccatt tggaaataat attttctatt 240 
agaaca atg aat ttc ata aag gac 294 
Met Asn Phe He Lys Asp 
1 5 
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aat age cga gec ctt att caa aga atg gga atg act gtt ata aag caa 342 
Asn Ser Arg Ala Leu lie Gin Arg Met Gly Met Thr Val lie Lys Gin 
10 15 20 

ate aca gat gac eta ttt gta tgg aat gtt ctg aat cgc gaa gaa gta 390 
lie Thr Asp Asp Leu Phe Val Trp Asn Val Leu Asn Arg Glu Glu Val 
25 30 35 

aac ate att tgc tgc gag aag gtg gag cag gat get get aga ggg ate 438 
Asn lie He Cys Cys Glu Lys Val Glu Gin Asp Ala Ala Arg Gly He 
40 45 50 

att cac atg att ttg aaa aag ggt tea gag tec tgt aac etc ttt ctt 486 
He His Met He Leu Lys Lys Gly Ser Glu Ser Cys Asn Leu Phe Leu 
55 60 65 70 

aaa tec ctt aag gag tgg aac tat cct eta ttt cag gac ttg aat gga 534 
Lys Ser Leu Lys Glu Trp Asn Tyr Pro Leu Phe Gin Asp Leu Asn Gly 
75 80 85 

caa agt ctt ttt cat cag aca tea gaa gga gac ttg gac gat ttg get 582 
Gin Ser Leu Phe His Gin Thr Ser Glu Gly Asp Leu Asp Asp Leu Ala 
90 95 100 

cag gat tta aag gac ttg tac cat acc cca tct ttt ctg aac ttt tat 630 
Gin Asp Leu Lys Asp Leu Tyr His Thr Pro Ser Phe Leu Asn Phe Tyr 
105 110 115 

ccc ctt ggt gaa gat att gac att att ttt aac ttg aaa age acc ttc 678 
Pro Leu Gly Glu Asp He Asp He He Phe Asn Leu Lys Ser Thr Phe 
120 125 130 

aca gaa cct ate ctg tgg agg aag gac caa cac cat cac cgc gtg gag 726 
Thr Glu Pro He Leu Trp Arg Lys Asp Gin His His His Arg Val Glu 
135 140 145 150 

cag ctg acc ctg aat ggc etc ctg cag get ctt cag age ccc tgc ate 774 
Gin Leu Thr Leu Asn Gly Leu Leu Gin Ala Leu Gin Ser Pro Cys He 
155 160 165 

att gaa ggg gaa tct ggc aaa ggc aag tec act ctg ctg cag cgc att 822 
He Glu Gly Glu Ser Gly Lys Gly Lys Ser Thr Leu Leu Gin Arg He 
170 175 180 

gec atg etc tgg ggc tec gga aag tgc aag get ctg acc aag ttc aaa 870 
Ala Met Leu Trp Gly Ser Gly Lys Cys Lys Ala Leu Thr Lys Phe Lys 
185 190 195 

ttc gtc ttc ttc etc cgt etc age agg gee cag ggt gga ctt ttt gaa 918 
Phe Val Phe Phe Leu Arg Leu Ser Arg Ala Gin Gly Gly Leu Phe Glu 
200 205 210 

acc etc tgt gat caa etc ctg gat ata cct ggc aca ate agg aag cag 966 
Thr Leu Cys Asp Gin Leu Leu Asp He Pro Gly Thr He Arg Lys Gin 
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215 220 225 230 

aca ttc atg gcc atg ctg ctg aag ctg egg cag agg gtt ctt ttc ctt 1014 

Thr Phe Met Ala Met Leu Leu Lys Leu Arg Gin Arg Val Leu Phe Leu 
235 240 245 

ctt gat ggc tac aat gaa ttc aag ccc cag aac tgc cca gaa ate gaa 1062 

Leu Asp Gly Tyr Asn Glu Phe Lys Pro Gin Asn Cys Pro Glu lie Glu 
250 255 260 

gcc ctg ata aag gaa aac cac cgc ttc aag aac atg gtc ate gtc acc 1110 

Ala Leu lie Lys Glu Asn His Arg Phe Lys Asn Met Val lie Val Thr 
265 270 275 

act acc act gag tgc ctg agg cac ata egg cag ttt ggt gcc ctg act 1158 

Thr Thr Thr Glu Cys Leu Arg His He Arg Gin Phe Gly Ala Leu Thr 
280 285 290 

get gag gtg ggg gat atg aca gaa gac age gcc cag get etc ate cga 1206 

Ala Glu Val Gly Asp Met Thr Glu Asp Ser Ala Gin Ala Leu He Arg 

295 300 305 310 

gaa gtg ctg ate aag gag ctt get gaa ggc ttg ttg etc caa att cag 12 54 

Glu Val Leu He Lys Glu Leu Ala Glu Gly Leu Leu Leu Gin He Gin 
315 320 325 

aaa tec agg tgc ttg agg aat etc atg aag acc cct etc ttt gtg gtc 1302 

Lys Ser Arg Cys Leu Arg Asn Leu Met Lys Thr Pro Leu Phe Val Val 
330 335 340 

ate act tgt gca ate cag atg ggt gaa agt gag ttc cac tct cac aca 1350 

He Thr Cys Ala He Gin Met Gly Glu Ser Glu Phe His Ser His Thr 
345 350 355 

caa aca acg ctg ttc cat acc ttc tat gat ctg ttg ata cag aaa aac 1398 

Gin Thr Thr Leu Phe His Thr Phe Tyr Asp Leu Leu He Gin Lys Asn 
360 365 370 

aaa cac aaa cat aaa ggt gtg get gca agt gac ttc att egg age ctg 1446 

Lys His Lys His Lys Gly Val Ala Ala Ser Asp Phe He Arg Ser Leu 

375 380 385 390 

gac cac cgt gga gac eta get ctg gag ggt gtg ttc tec cac aag ttt 1494 

Asp His Arg Gly Asp Leu Ala Leu Glu Gly Val Phe Ser His Lys Phe 
395 400 405 

gat ttc gaa ctg cag gat gtg tec age gtg aat gag gat gtc ctg ctg 1542 
Asp Phe Glu Leu Gin Asp Val Ser Ser Val Asn Glu Asp Val Leu Leu 
410 415 420 

aca act ggg etc etc tgt aaa tat aca get caa agg ttc aag cca aag 1590 
Thr Thr Gly Leu Leu Cys Lys Tyr Thr Ala Gin Arg Phe Lys Pro Lys 
425 430 435 

tat aaa ttc ttt cac aag tea ttc cag gag tac aca gca gga cga aga 1638 
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Tyr Lys Phe Phe His Lys Ser Phe Gin Glu Tyr Thr Ala Gly Arg Arg 
440 445 450 

etc age agt tta ttg acg tct cat gag cca gag gag gtg acc aag ggg 1686 
Leu Ser Ser Leu Leu Thr Ser His Glu Pro Glu Glu Val Thr Lys Gly 
455 460 465 470 

aat ggt tac ttg cag aaa atg gtt tec att teg gac att aca tec act 1734 
Asn Gly Tyr Leu Gin Lys Met Val Ser He Ser Asp He Thr Ser Thr 
475 480 485 

tat age age ctg etc egg tac acc tgt ggg tea tct gtg gaa gee acc 1782 
Tyr Ser Ser Leu Leu Arg Tyr Thr Cys Gly Ser Ser Val Glu Ala Thr 
490 495 500 

agg get gtt atg aag cac etc gca gca gtg tat caa cac ggc tgc ctt 183 0 
Arg Ala Val Met Lys His Leu Ala Ala Val Tyr Gin His Gly Cys Leu 
505 510 515 

etc gga ctt tec ate gec aag agg cct etc tgg aga cag gaa tct ttg 1878 
Leu Gly Leu Ser He Ala Lys Arg Pro Leu Trp Arg Gin Glu Ser Leu 
520 525 530 

caa agt gtg aaa aac acc act gag caa gaa att ctg aaa gec ata aac 1926 
Gin Ser Val Lys Asn Thr Thr Glu Gin Glu He Leu Lys Ala He Asn 
535 540 545 550 

ate aat tec ttt gta gag tgt ggc ate cat tta tat caa gag agt aca 1974 
He Asn Ser Phe Val Glu Cys Gly He His Leu Tyr Gin Glu Ser Thr 
555 560 565 

tec aaa tea gee ctg age caa gaa ttt gaa get ttc ttt caa ggt aaa 2022 
Ser Lys Ser Ala Leu Ser Gin Glu Phe Glu Ala Phe Phe Gin Gly Lys 
570 575 580 

age tta tat ate aac tea ggg aac ate ccc gat tac tta ttt gac ttc 2070 
Ser Leu Tyr He Asn Ser Gly Asn lie Pro Asp Tyr Leu Phe Asp Phe 
585 590 595 

ttt gaa cat ttg ccc aat tgt gca agt gee ctg gac ttc att aaa ctg 2118 
Phe Glu His Leu Pro Asn Cys Ala Ser Ala Leu Asp Phe He Lys Leu 
600 605 610 

gac ttt tat ggg gga get atg get tea tgg gaa aag get gca gaa gac 2166 
Asp Phe Tyr Gly Gly Ala Met Ala Ser Trp Glu Lys Ala Ala Glu Asp 
615 620 625 630 

aca ggt gga ate cac atg gaa gag gec cca gaa acc tac att ccc age 2214 
Thr Gly Gly He His Met Glu Glu Ala Pro Glu Thr Tyr He Pro Ser 
635 640 645 

a< 39 9" ct 9"ta tct ttg ttc ttc aac tgg aag cag gaa ttc agg act ctg 2262 
Arg Ala Val Ser Leu Phe Phe Asn Trp Lys Gin Glu Phe Arg Thr Leu 
650 655 660 
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gag gtc aca etc egg gat ttc age aag ttg aat aag caa gat ate aga 2310 

Glu Val Thr Leu Arg Asp Phe Ser Lys Leu Asn Lys Gin Asp He Arg 
665 670 675 

tat ctg ggg aaa ata ttc age tct gee aca age etc agg ctg caa ata 2358 

Tyr Leu Gly Lys He Phe Ser Ser Ala Thr Ser Leu Arg Leu Gin He 
680 685 690 

aag aga tgt get ggt gtg get gga age etc agt ttg gtc etc age ace 2406 

Lys Arg Cys Ala Gly Val Ala Gly Ser Leu Ser Leu Val Leu Ser Thr 

695 700 705 710 

tgt aag aac att tat tct etc atg gtg gaa gee agt ccc etc acc ata 2454 

Cys Lys Asn He Tyr Ser Leu Met Val Glu Ala Ser Pro Leu Thr He 

715 720 725 

gaa gat gag agg cac ate aca tct gta aca aac ctg aaa acc ttg agt 2502 

Glu Asp Glu Arg His He Thr Ser Val Thr Asn Leu Lys Thr Leu Ser 
730 735 740 

att cat gac eta cag aat caa egg ctg ccg ggt ggt ctg act gac age 2550 

He His Asp Leu Gin Asn Gin Arg Leu Pro Gly Gly Leu Thr Asp Ser 
745 750 755 

ttg ggt aac ttg aag aac ctt aca aag etc ata atg gat aac ata aag 2 598 

Leu Gly Asn Leu Lys Asn Leu Thr Lys Leu He Met Asp Asn He Lys 
760 765 770 

atg aat gaa gaa gat get ata aaa eta get gaa ggc ctg aaa aac ctg 2646 

Met Asn Glu Glu Asp Ala He Lys Leu Ala Glu Gly Leu Lys Asn Leu 

775 780 785 790 

aag aag atg tgt tta ttt cat ttg acc cac ttg tct gac att gga gag 2694 

Lys Lys Met Cys Leu Phe His Leu Thr His Leu Ser Asp He Gly Glu 

795 800 805 

gga atg gat tac ata gtc aag tct ctg tea agt gaa ccc tgt gac ctt 2742 

Gly Met Asp Tyr He Val Lys Ser Leu Ser Ser Glu Pro Cys Asp Leu 
810 815 820 

gaa gaa att caa tta gtc tec tgc tgc ttg tct gca aat gca gtg aaa 2 790 

Glu Glu He Gin Leu Val Ser Cys Cys Leu Ser Ala Asn Ala Val Lys 
825 830 835 

ate eta get cag aat ctt cac aat ttg gtc aaa ctg age att ctt gat 2838 

He Leu Ala Gin Asn Leu His Asn Leu Val Lys Leu Ser He Leu Asp 
840 845 850 

tta tea gaa aat tac ctg gaa aaa gat gga aat gaa get ctt cat gaa 2886 

Leu Ser Glu Asn Tyr Leu Glu Lys Asp Gly Asn Glu Ala Leu His Glu 

855 860 865 870 

ctg ate gac agg atg aac gtg eta gaa cag etc acc gca ctg atg ctg 2934 

Leu He Asp Arg Met Asn Val Leu Glu Gin Leu Thr Ala Leu Met Leu 

875 880 885 



- 24 - 



WO 01/90156 



PCT/US01/17I58 



ccc tgg ggc tgt gac gtg caa ggc age ctg age age ctg ttg aaa cat 2982 
Pro Trp Gly Cys Asp Val Gin Gly Ser Leu Ser Ser Leu Leu Lys His 
890 895 900 

ttg gag gag gtc cca caa etc gec aag ctt ggg ttg aaa aac tgg aga 3030 
Leu Glu Glu Val Pro Gin Leu Val Lys Leu Gly Leu Lys Asn Trp Arg 
905 910 915 

etc aca gat aca gag att aga att tta ggt gca ttt ttt gga aag aac 3078 
Leu Thr Asp Thr Glu lie Ajrg lie Leu Gly Ala Phe Phe Gly Lys Asn 
920 925 930 

cct ctg aaa aac ttc cag cag ttg aat ttg gcg gga aat cgt gtg age 3126 
Pro Leu Lys Asn Phe Gin Gin Leu Asn Leu Ala Gly Asn Arg Val Ser 
935 940 945 950 

agt gat gga tgg ctt gee ttc atg ggt gta ttt gag aat ctt aag caa 3174 
Ser Asp Gly Trp Leu Ala Phe Met Gly Val Phe Glu Asn Leu Lys Gin 
955 960 965 

tta gtg ttt ttt gac ttt agt act aaa gaa ttt eta cct gat cca gca 3222 
Leu Val Phe Phe Asp Phe Ser Thr Lys Glu Phe Leu Pro Asp Pro Ala 
970 975 980 

tta gtc aga aaa ctt age caa gtg tta tec aag tta act ttt ctg caa 3270 
Leu Val Arg Lys Leu Ser Gin Val Leu Ser Lys Leu Thr Phe Leu Gin 
985 990 995 

gaa get agg ctt gtt ggg tgg caa ttt gat gat gat gat etc agt gtt 3318 
Glu Ala Arg Leu Val Gly Trp Gin Phe Asp Asp Asp Asp Leu Ser Val 
1000 1005 1010 

att aca ggt get ttt aaa eta gta act get taaataaagt gtactcgaag 3368 
lie Thr Gly Ala Phe Lys Leu Val Thr Ala 
1015 1020 

ccaaaaaaaa aaaaaaaaaa aaaaaaaa 33 96 

<210> 97 

<211> 1024 

<212> PRT 

<213> Homo sapien 

<400> 97 

Met Asn Phe lie Lys Asp Asn Ser Arg Ala Leu lie Gin Arg Met Gly 

15 10 15 

Met Thr Val lie Lys Gin lie Thr Asp Asp Leu Phe Val Trp Asn Val 

20 25 30 

Leu Asn Arg Glu Glu Val Asn lie lie Cys Cys Glu Lys Val Glu Gin 

35 40 45 

Asp Ala Ala Arg Gly lie He His Met He Leu Lys Lys Gly Ser Glu 

50 55 60 

Ser Cys Asn Leu Phe Leu Lys Ser Leu Lys Glu Trp Asn Tyr Pro Leu 
65 70 75 80 
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Phe Gin Asp Leu Asn Gly Gin Ser Leu Phe His Gin Thr Ser Glu Gly 

85 90 95 

Asp Leu Asp Asp Leu Ala Gin Asp Leu Lys Asp Leu Tyr His Thr Pro 

100 105 110 

Ser Phe Leu Asn Phe Tyr Pro Leu Gly Glu Asp lie Asp He He Phe 

115 120 125 

Asn Leu Lys Ser Thr Phe Thr Glu Pro He Leu Trp Arg Lys Asp Gin 

130 135 140 

His His His Arg Val Glu Gin Leu Thr Leu Asn Gly Leu Leu Gin Ala 
145 150 155 160 

Leu Gin Ser Pro Cys He He Glu Gly Glu Ser Gly Lys Gly Lys Ser 

165 170 175 

Thr Leu Leu Gin Arg He Ala Met Leu Trp Gly Ser Gly Lys Cys Lys 

180 185 190 

Ala Leu Thr Lys Phe Lys Phe Val Phe Phe Leu Arg Leu Ser Arg Ala 

195 200 205 

Gin Gly Gly Leu Phe Glu Thr Leu Cys Asp Gin Leu Leu Asp He Pro 

210 215 220 

Gly Thr He Arg Lys Gin Thr Phe Met Ala Met Leu Leu Lys Leu Arg 
225 230 235 240 

Gin Arg Val Leu Phe Leu Leu Asp Gly Tyr Asn Glu Phe Lys Pro Gin 

245 250 255 

Asn Cys Pro Glu He Glu Ala Leu He Lys Glu Asn His Arg Phe Lys 

260 265 270 

Asn Met Val lie Val Thr Thr Thr Thr Glu Cys Leu Arg His He Arg 

275 280 285 

Gin Phe Gly Ala Leu Thr Ala Glu Val Gly Asp Met Thr Glu Asp Ser 

290 295 300 

Ala Gin Ala Leu He Arg Glu Val Leu He Lys Glu Leu Ala Glu Gly 
305 310 315 320 

Leu Leu Leu Gin He Gin Lys Ser Arg Cys Leu Arg Asn Leu Met Lys 

325 330 335 

Thr Pro Leu Phe Val Val He Thr Cys Ala He Gin Met Gly Glu Ser 

340 345 350 

Glu Phe His Ser His Thr Gin Thr Thr Leu Phe His Thr Phe Tyr Asp 

355 360 365 

Leu Leu He Gin Lys Asn Lys His Lys His Lys Gly Val Ala Ala Ser 

370 375 380 

Asp Phe He Arg Ser Leu Asp His Arg Gly Asp Leu Ala Leu Glu Gly 
385 390 395 400 

Val Phe Ser His Lys Phe Asp Phe Glu Leu Gin Asp Val Ser Ser Val 

405 410 415 

Asn Glu Asp Val Leu Leu Thr Thr Gly Leu Leu Cys Lys Tyr Thr Ala 

420 425 430 

Gin Arg Phe Lys Pro Lys Tyr Lys Phe Phe His Lys Ser Phe Gin Glu 

435 440 445 

Tyr Thr Ala Gly Arg Arg Leu Ser Ser Leu Leu Thr Ser His Glu Pro 

450 455 460 

Glu Glu Val Thr Lys Gly Asn Gly Tyr Leu Gin Lys Met Val Ser He 
465 470 475 480 

Ser Asp He Thr Ser Thr Tyr Ser Ser Leu Leu Arg Tyr Thr Cys Gly 

485 490 495 

Ser Ser Val Glu Ala Thr Arg Ala Val Met Lys His Leu Ala Ala Val 

500 505 510 

Tyr Gin His Gly Cys Leu Leu Gly Leu Ser He Ala Lys Arg Pro Leu 
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515 520 525 

Trp Arg Gin Glu Ser Leu Gin Ser Val Lys Asn Thr Thr Glu Gin Glu 

530 535 540 

He Leu Lys Ala lie Asn He Asn Ser Phe Val Glu Cys Gly He His 
545 550 555 560 

Leu Tyr Gin Glu Ser Thr Ser Lys Ser Ala Leu Ser Gin Glu Phe Glu 

565 570 575 

Ala Phe Phe Gin Gly Lys Ser Leu Tyr He Asn Ser Gly Asn He Pro 

580 585 590 

Asp Tyr Leu Phe Asp Phe Phe Glu His Leu Pro Asn Cys Ala Ser Ala 

595 600 605 

Leu Asp Phe lie Lys Leu Asp Phe Tyr Gly Gly Ala Met Ala Ser Trp 

610 615 620 

Glu Lys Ala Ala Glu Asp Thr Gly Gly He His Met Glu Glu Ala Pro 
625 630 635 640 

Glu Thr Tyr He Pro Ser Arg Ala Val Ser Leu Phe Phe Asn Trp Lys 

645 650 655 

Gin Glu Phe Arg Thr Leu Glu Val Thr Leu Arg Asp Phe Ser Lys Leu 

660 665 670 

Asn Lys Gin Asp He Arg Tyr Leu Gly Lys He Phe Ser Ser Ala Thr 

675 680 685 

Ser Leu Arg Leu Gin He Lys Arg Cys Ala Gly Val Ala Gly Ser Leu 

690 695 700 

Ser Leu Val Leu Ser Thr Cys Lys Asn He Tyr Ser Leu Met Val Glu 
705 710 715 720 

Ala Ser Pro Leu Thr lie Glu Asp Glu Arg His He Thr Ser Val Thr 

725 730 735 

Asn Leu Lys Thr Leu Ser He His Asp Leu Gin Asn Gin Arg Leu Pro 

740 745 750 

Gly Gly Leu Thr Asp Ser Leu Gly Asn Leu Lys Asn Leu Thr Lys Leu 

755 760 765 

He Met Asp Asn He Lys Met Asn Glu Glu Asp Ala He Lys Leu Ala 

770 775 780 

Glu Gly Leu Lys Asn Leu Lys Lys Met Cys Leu Phe His Leu Thr His 
785 790 795 800 

Leu Ser Asp He Gly Glu Gly Met Asp Tyr He Val Lys Ser Leu Ser 

805 810 815 

Ser Glu Pro Cys Asp Leu Glu Glu He Gin Leu Val Ser Cys Cys Leu 

820 825 830 

Ser Ala Asn Ala Val Lys lie Leu Ala Gin Asn Leu His Asn Leu Val 

835 840 845 

Lys Leu Ser He Leu Asp Leu Ser Glu Asn Tyr Leu Glu Lys Asp Gly 

850 855 860 

Asn Glu Ala Leu His Glu Leu He Asp Arg Met Asn Val Leu Glu Gin 
865 870 875 880 

Leu Thr Ala Leu Met Leu Pro Trp Gly Cys Asp Val Gin Gly Ser Leu 

885 890 895 

Ser Ser Leu Leu Lys His Leu Glu Glu Val Pro Gin Leu Val Lys Leu 

900 905 910 

Gly Leu Lys Asn Trp Arg Leu Thr Asp Thr Glu lie Arg He Leu Gly 

915 920 925 

Ala Phe Phe Gly Lys Asn Pro Leu Lys Asn Phe Gin Gin Leu Asn Leu 

930 935 940 

Ala Gly Asn Arg Val Ser Ser Asp Gly Trp Leu Ala Phe Met Gly Val 
945 950 955 960 
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Phe Glu Asn Leu Lys Gin Leu Val Phe Phe Asp Phe Ser Thr Lys Glu 

965 970 975 

Phe Leu Pro Asp Pro Ala Leu Val Arg Lys Leu Ser Gin Val Leu Ser 

980 985 990 

Lys Leu Thr Phe Leu Gin Glu Ala Arg Leu Val Gly Trp Gin Phe Asp 

995 1000 1005 

Asp Asp Asp Leu Ser Val He Thr Gly Ala Phe Lys Leu Val Thr Ala 
1010 1015 1020 



<210> 98 

<211> 1395 

<212> DNA 

<213> Homo sapien 

<220> 
<221> CDS 

<222> (277) . . . (1353) 



<400> 98 

cgcccgggca ggtgtttata ctccggaggg 
aaaactggtg atctgtttgc cctgtgtgac 
acatctgctg gaagtcctct gggattcaag 
aaaagaggac aatattggga tcacctttga 
gtgttataga aaggtgggaa gctttcatcc 



tgtccccgtg cgtcatcggt ggagtggacc 60 
cttgcccaga accctgctga ctgagagaac 120 
gtacagggaa tgaagagtag ttttacagaa 180 
cctttccatt tggaaataat attttctatt 240 
agaaca atg aat ttc ata aag gac 294 
Met Asn Phe He Lys Asp 
1 5 



aat age cga gec ctt att caa aga atg gga atg act gtt ata aag caa 342 
Asn Ser Arg Ala Leu He Gin Arg Met Gly Met Thr Val He Lys Gin 
10 15 20 

ate aca gat gac eta ttt gta tgg aat gtt ctg aat cgc gaa gaa gta 390 
He Thr Asp Asp Leu Phe Val Trp Asn Val Leu Asn Arg Glu Glu Val 
25 30 35 



aac ate att tgc tgc gag aag gtg gag cag gat get get aga ggg ate 438 
Asn He He Cys Cys Glu Lys Val Glu Gin Asp Ala Ala Arg Gly He 
40 45 50 



att cac atg att ttg aaa aag ggt tea gag tec tgt aac etc ttt ctt 486 
He His Met He Leu Lys Lys Gly Ser Glu Ser Cys Asn Leu Phe Leu 
55 60 65 70 



aaa tec ctt aag gag tgg aac tat cct eta ttt cag gac ttg aat gga 534 
Lys Ser Leu Lys Glu Trp Asn Tyr Pro Leu Phe Gin Asp Leu Asn Gly 
75 80 85 



caa agt ggt ctg act gac age ttg ggt aac ttg aag aac ctt aca aag 582 
Gin Ser Gly Leu Thr Asp Ser Leu Gly Asn Leu Lys Asn Leu Thr Lys 
90 95 100 



etc ata atg gat aac ata aag atg aat gaa gaa gat get ata aaa eta 630 
Leu He Met Asp Asn He Lys Met Asn Glu Glu Asp Ala He Lys Leu 
105 110 115 
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get gaa ggc ctg 
Ala Glu Gly Leu 
120 

cac ttg tct gac 
His Leu Ser Asp 
135 

tea agt gaa ccc 
Ser Ser Glu Pro 



ttg tct gca aat 
Leu Ser Ala Asn 
170 

gtc aaa ctg age 
Val Lys Leu Ser 
185 

gga aat gaa get 
Gly Asn Glu Ala 
200 

cag etc acc gca 
Gin Leu Thr Ala 
215 

ctg age age ctg 
Leu Ser Ser Leu 



ctt ggg ttg aaa 
Leu Gly Leu Lys 
250 

ggt gca ttt ttt 
Gly Ala Phe Phe 
265 

ttg gcg gga aat 
Leu Ala Gly Asn 

280 

gta ttt gag aat 
Val Phe Glu Asn 
295 

gaa ttt eta cct 
Glu Phe Leu Pro 



tec aag tta act 
Ser Lys Leu Thr 



aaa aac ctg 
Lys Asn Leu 
125 

att gga gag 
lie Gly Glu 
140 

tgt gac ctt 
Cys Asp Leu 
155 

gca gtg aaa 
Ala Val Lys 



att ctt gat 
lie Leu Asp 



ctt cat gaa 
Leu His Glu 
205 

ctg atg ctg 
Leu Met Leu 
220 

ttg aaa cat 
Leu Lys His 
235 

aac tgg aga 
Asn Trp Arg 



gga aag aac 
Gly Lys Asn 



cgt gtg age 
Arg Val Ser 
285 

ctt aag caa 
Leu Lys Gin 
300 

gat cca gca 
Asp Pro Ala 
315 

ttt ctg caa 
Phe Leu Gin 



aag aag atg tgt 
Lys Lys Met Cys 



gga atg gat tac 
Gly Met Asp Tyr 
145 

gaa gaa att caa 
Glu Glu lie Gin 
160 

ate eta get cag 
lie Leu Ala Gin 
175 

tta tea gaa aat 
Leu Ser Glu Asn 
190 

ctg ate gac agg 
Leu lie Asp Arg 



ccc tgg ggc tgt 
Pro Trp Gly Cys 
225 

ttg gag gag gtc 
Leu Glu Glu Val 
240 

etc aca gat aca 
Leu Thr Asp Thr 
255 

cct ctg aaa aac 
Pro Leu Lys Asn 
270 

agt gat gga tgg 
Ser Asp Gly Trp 



tta gtg ttt ttt 
Leu Val Phe Phe 
305 

tta gtc aga aaa 
Leu Val Arg Lys 
320 

gaa get agg ctt 
Glu Ala Arg Leu 



tta ttt cat 
Leu Phe Kis 
130 

ata gtc aag 
He Val Lys 



tta gtc tec 
Leu Val Ser 



aat ctt cac 
Asn Leu His 
180 

tac ctg gaa 
Tyr Leu Glu 
195 

atg aac gtg 
Met Asn Val 
210 

gac gtg caa 
Asp Val Gin 



cca caa etc 
Pro Gin Leu 



gag att aga 
Glu He Arg 
260 

ttc cag cag 
Phe Gin Gin 
275 

ctt gee ttc 
Leu Ala Phe 
290 

gac ttt agt 
Asp Phe Ser 



ctt age caa 
Leu Ser Gin 



gtt ggg tgg 
Val Gly Trp 



ttg acc 678 
Leu Thr 



tct ctg 726 
Ser Leu 
150 

tgc tgc 77 4 
Cys Cys 
165 

aat ttg 822 
Asn Leu 



aaa gat 870 
Lys Asp 



eta gaa 918 
Leu Glu 



ggc age 966 
Gly Ser 
230 

gtc aag 1014 

Val Lys 

245 

att tta 1062 
He Leu 



ttg aat 1110 
Leu Asn 



atg ggt 1158 
Met Gly 



act aaa 12 06 
Thr Lys 
310 

gtg tta 1254 

Val Leu 

325 

caa ttt 1302 
Gin Phe 
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330 



335 



340 



gat gat gat gat etc agt gtt att aca ggt get ttt aaa eta gta act 1350 
Asp Asp Asp Asp Leu Ser Val lie Thr Gly Ala Phe Lys Leu Val Thr 
345 350 355 

get taaataaagt gtactcgaag caaaaaaaaa aaaaaaaaaa aa 13 95 

Ala 



<210> 99 

<211> 359 

<212> PRT 

<213> Homo sapien 

<400> 99 

Met Asn Phe lie Lys Asp Asn Ser Arg Ala Leu lie Gin Arg Met Gly 

15 10 15 

Met Thr Val He Lys Gin He Thr Asp Asp Leu Phe Val Trp Asn Val 

20 25 30 

Leu Asn Arg Glu Glu Val Asn He He Cys Cys Glu Lys Val Glu Gin 

35 40 45 

Asp Ala Ala Arg Gly He He His Met He Leu Lys Lys Gly Ser Glu 

50 55 60 

Ser Cys Asn Leu Phe Leu Lys Ser Leu Lys Glu Trp Asn Tyr Pro Leu 
65 70 75 80 

Phe Gin Asp Leu Asn Gly Gin Ser Gly Leu Thr Asp Ser Leu Gly Asn 

85 90 95 

Leu Lys Asn Leu Thr Lys Leu He Met Asp Asn He Lys Met Asn Glu 

100 105 110 

Glu Asp Ala He Lys Leu Ala Glu Gly Leu Lys Asn Leu Lys Lys Met 

115 120 125 

Cys Leu Phe His Leu Thr His Leu Ser Asp He Gly Glu Gly Met Asp 

130 135 140 

Tyr He Val Lys Ser Leu Ser Ser Glu Pro Cys Asp Leu Glu Glu He 
145 150 155 160 

Gin Leu Val Ser Cys Cys Leu Ser Ala Asn Ala Val Lys He Leu Ala 

165 170 175 

Gin Asn Leu His Asn Leu Val Lys Leu Ser lie Leu Asp Leu Ser Glu 

180 185 190 

Asn Tyr Leu Glu Lys Asp Gly Asn Glu Ala Leu His Glu Leu He Asp 

195 200 205 

Arg Met Asn Val Leu Glu Gin Leu Thr Ala Leu Met Leu Pro Trp Gly 

210 215 220 

Cys Asp Val Gin Gly Ser Leu Ser Ser Leu Leu Lys His Leu Glu Glu 
225 230 235 240 

Val Pro Gin Leu Val Lys Leu Gly Leu Lys Asn Trp Arg Leu Thr Asp 

245 250 255 

Thr Glu He Arg He Leu Gly Ala Phe Phe Gly Lys Asn Pro Leu Lys 

260 265 270 

Asn Phe Gin Gin Leu Asn Leu Ala Gly Asn Arg Val Ser Ser Asp Gly 

275 280 285 

Trp Leu Ala Phe Met Gly Val Phe Glu Asn Leu Lys Gin Leu Val Phe 
290 295 300 
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Phe Asp Phe Ser Thr Lys Glu Phe Leu Pro Asp Pro Ala Leu Val Arg 
305 310 315 320 

Lys Leu Ser Gin Val Leu Ser Lys Leu Thr Phe Leu Gin Glu Ala Arg 

325 330 335 

Leu Val Gly Trp Gin Phe Asp Asp Asp Asp Leu Ser Val lie Thr Gly 

340 345 350 

Ala Phe Lys Leu Val Thr Ala 
355 



<210> 100 
<211> 578 
<212> DNA 
<213> Homo sapien 

<220> 
<221> CDS 

<222> (277) . . . (552) 
<400> 100 

cgcccgggca ggtgtttata ctccggaggg tgtccccgtg cgtcatcggt ggagtggacc SO 
aaaactggtg atctgtttgc cctgtgtgac cttgcccaga accctgctga ctgagagaac 120 
acatctgctg gaagtcctct gggattcaag gtacagggaa tgaagagtag ttttacagaa 180 
aaaagaggac aatattggga tcacctttga cctttccatt tggaaataat attttctatt 240 
gtgttataga aaggtgggaa gctttcatcc agaaca atg aat ttc ata aag gac 294 

Met Asn Phe lie Lys Asp 
1 5 

aat age cga gec ctt att caa aga atg gga atg act gtt ata aag caa 342 
Asn Ser Arg Ala Leu lie Gin Arg Met Gly Met Thr Val He Lys Gin 
10 15 20 

ate aca gat gac eta ttt gta tgg aat gtt ctg aat cgc gaa gaa gta 390 
lie Thr Asp Asp Leu Phe Val Trp Asn Val Leu Asn Arg Glu Glu Val 
25 30 35 

aac ate att tgc tgc gag aag gtg gag cag gat get get aga ggg ate 43 8 
Asn He He Cys Cys Glu Lys Val Glu Gin Asp Ala Ala Arg Gly He 
40 45 50 

att cac atg att ttg aaa aag ggt tea gag tec tgt aac etc ttt ctt 486 
He His Met He Leu Lys Lys Gly Ser Glu Ser Cys Asn Leu Phe Leu 
55 60 65 70 

aaa tec ctt aag gag tgg aac tat cct eta ttt cag gac ttg aat gga 534 
Lys Ser Leu Lys Glu Trp Asn Tyr Pro Leu Phe Gin Asp Leu Asn Gly 
75 80 85 

caa agt ctt tta aca get tagaaagtac agtagacata ctgggg 578 
Gin Ser Leu Leu Thr Ala 

90 



<210> 101 
<211> 92 
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<212> PRT 

<213> Homo sapien 



<400> 101 

Met Asn Phe lie Lys Asp Asn Ser 

1 5 
Met Thr Val He Lys Gin lie Thr 
20 

Leu Asn Arg Glu Glu Val Asn He 

35 40 
Asp Ala Ala Arg Gly He He His 

50 55 
Ser Cys Asn Leu Phe Leu Lys Ser 
65 70 
Phe Gin Asp Leu Asn Gly Gin Ser 
85 



Arg Ala Leu He Gin Arg Met Gly 

10 15 
Asp Asp Leu Phe Val Trp Asn Val 
25 30 
He Cys Cys Glu Lys Val Glu Gin 
45 

Met He Leu Lys Lys Gly Ser Glu 
60 

Leu Lys Glu Trp Asn Tyr Pro Leu 

75 80 
Leu Leu Thr Ala 
90 



<210> 102 
<211> 768 
<212> DNA 
<213> Homo sapien 

<220> 
<221> CDS 

<222> {277) . . . (744) 



<400> 102 

cgcccgggca ggtgtttata ctccggaggg 
aaaactggtg atctgtttgc cctgtgtgac 
acatctgctg gaagtcctct gggattcaag 
aaaagaggac aatattggga tcacctttga 
gtgttataga aaggtgggaa gctttcatcc 



tgtccccgtg cgtcatcggt ggagtggacc 6 0 
cttgcccaga accctgctga ctgagagaac 120 
gtacagggaa tgaagagtag ttttacagaa 18 0 
cctttccatt tggaaataat attttctatt 240 
agaaca atg aat ttc ata aag gac 294 
Met Asn Phe He Lys Asp 
1 5 



aat age cga gec ctt att caa aga atg gga atg act gtt ata aag caa 342 
Asn Ser Arg Ala Leu He Gin Arg Met Gly Met Thr Val He Lys Gin 
10 15 20 



ate aca gat gac eta ttt gta tgg aat gtt ctg aat cgc gaa gaa gta 390 
He Thr Asp Asp Leu Phe Val Trp Asn Val Leu Asn Arg Glu Glu Val 
25 30 35 



aac ate att tgc tgc gag aag gtg gag cag gat get get aga ggg ate 438 
Asn He He Cys Cys Glu Lys Val Glu Gin Asp Ala Ala Arg Gly He 
40 45 50 



att cac atg att ttg aaa aag ggt tea gag tec tgt aac etc ttt ctt 486 
He His Met He Leu Lys Lys Gly Ser Glu Ser Cys Asn Leu Phe Leu 
55 60 65 70 



aaa tec ctt aag gag tgg aac tat cct eta ttt cag gac ttg aat gga 534 
Lys Ser Leu Lys Glu Trp Asn Tyr Pro Leu Phe Gin Asp Leu Asn Gly 
75 80 85 
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caa agt ctt ttt cat cag aca tea gaa gga gac ttg gac gat ttg get 582 
Gin Ser Leu Phe His Gin Thr Ser Glu Giy Asp Lea Asp Asp Leu Ala 
90 95 100 

cag gat tta aag gac ttg tac cat ace cca tct ttt ctg aac ttt tat 63 0 
Gin Asp Leu Lys Asp Leu Tyr His Thr Pro Ser Phe Leu Asn Phe Tyr 
105 110 115 

ccc ctt ggt gaa gat att gac att att ttt aac ttg aaa age acc ttc 678 
Pro Leu Gly Glu Asp lie Asp lie lie Phe Asn Leu Lys Ser Thr Phe 
120 125 130 

aca gaa cct gtc ctg tgg agg aag gac caa cac cat cac cgc gtg gag 726 
Thr Glu Pro Val Leu Trp Arg Lys Asp Gin His His His Arg Val Glu 
135 140 145 150 

cag ctg acc eta gtt tta tagcatcttc tacctgcccg ggcg 768 
Gin Leu Thr Leu Val Leu 
155 



<210> 103 
<211> 156 
<212> PRT 
<213> Homo sapien 

<400> 103 

Met Asn Phe lie Lys Asp Asn Ser 

1 5 
Met Thr Val He Lys Gin He Thr 
20 

Leu Asn Arg Glu Glu Val Asn He 

35 40 
Asp Ala Ala Arg Gly He He His 

50 55 
Ser Cys Asn Leu Phe Leu Lys Ser 
65 70 
Phe Gin Asp Leu Asn Gly Gin Ser 
85 

Asp Leu Asp Asp Leu Ala Gin Asp 
100 

Ser Phe Leu Asn Phe Tyr Pro Leu 
115 120 
Asn Leu Lys Ser Thr Phe Thr Glu 

130 135 
His His His Arg Val Glu Gin Leu 
145 150 



Arg Ala 


Leu 


He 


Gin Arg Met 


Gly 


10 






15 




Asp Asp 


Leu 


Phe 


Val Trp Asn 


Val 


25 






30 




He Cys 


Cys 


Glu 


Lys Val Glu 


Gin 








45 




Met He 


Leu 


Lys 


Lys Gly Ser 


Glu 






60 






Leu Lys 


Glu 


Trp 


Asn Tyr Pro 


Leu 




75 






80 


Leu Phe 


His 


Gin 


Thr Ser Glu 


Gly 


90 






95 




Leu Lys 


Asp 


Leu 


Tyr His Thr 


Pro 


105 






110 




Gly Glu 


Asp 


He 


Asp He He 


Phe 








125 




Pro Val 


Leu 


Trp 


Arg Lys Asp 


Gin 






140 






Thr Leu 


Val 


Leu 







155 



<210> 104 

<211> 24 

<212> DNA 

<213> Artificial 



Sequence 
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<220> 

<223> Primer 
<400> 104 

aagaagagac ggctgcttat caat 24 

<210> 105 
<211> 24 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 
<400> 105 

ccacagcagg cctcgaagat gate 24 

<210> 106 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 106 

atgatcctcc tgaagaagag 2 0 

<210> 107 

<211> 1009 

<212> PRT 

<213> Homo sapien 

<400> 107 

Cys Glu Met Cys Ser Gin Glu Ala Phe Gin Ala Gin Arg Ser Gin Leu 

15 10 15 

Val Glu Leu Leu Val Ser Gly Ser Leu Glu Gly Phe Glu Ser Val Leu 

20 25 30 

Asp Trp Leu Leu Ser Trp Glu Val Leu Ser Trp Glu Asp Tyr Glu Gly 

35 40 45 

Phe His Leu Leu Gly Gin Pro Leu Ser His Leu Ala Arg Arg Leu Leu 

50 55 60 

Asp Thr Val Trp Asn Lys Gly Thr Trp Ala Cys Gin Lys Leu lie Ala 
65 70 75 80 

Ala Ala Gin Glu Ala Gin Ala Asp Ser Gin Ser Pro Lys Leu His Gly 

85 90 95 

Cys Trp Asp Pro His Ser Leu His Pro Ala Arg Asp Leu Gin Ser His 

100 105 110 

Arg Pro Ala He Val Arg Arg Leu His Ser His Val Glu Asn Met Leu 

115 120 125 

Asp Leu Ala Trp Glu Arg Gly Phe Val Ser Gin Tyr Glu Cys Asp Glu 

130 135 140 

He Arg Leu Pro He Phe Thr Pro Ser Gin Arg Ala Arg Arg Leu Leu 
145 150 155 160 

Asp Leu Ala Thr Val Lys Ala Asn Gly Leu Ala Ala Phe Leu Leu Gin 
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165 170 175 

His Val Gin Glu Leu Pro Val Pro Leu Ala Leu Pro Leu Glu Ala Ala 

180 185 190 

Thr Cys Lys Lys Tyr Met Ala Lys Leu Arg Thr Thr Val Ser Ala Gin 

195 200 205 

Ser Arg Phe Leu Ser Thr Tyr Asp Gly Ala Glu Thr Leu Cys Leu Glu 

210 215 220 

Asp lie Tyr Thr Glu Asn Val Leu Glu Val Trp Ala Asp Val Gly Met 
225 230 235 240 

Ala Gly Pro Pro Gin Lys Ser Pro Ala Thr Leu Gly Leu Glu Glu Leu 

245 250 255 

Phe Ser Thr Pro Gly His Leu Asn Asp Asp Ala Asp Thr Vai Leu Val 

260 265 270 

Val Gly Glu Ala Gly Ser Gly Lys Ser Thr Leu Leu Gin Arg Leu His 

275 280 285 

Leu Leu Trp Ala Ala Gly Gin Asp Phe Gin Glu Phe Leu Phe Val Phe 

290 295 300 

Pro Phe Ser Cys Arg Gin Leu Gin Cys Met Ala Lys Pro Leu Ser Val 
305 310 315 320 

Arg Thr Leu Leu Phe Glu His Cys Cys Trp Pro Asp Val Gly Gin Glu 

325 330 335 

Asp He Phe Gin Leu Leu Leu Asp His Pro Asp Arg Val Leu Leu Thr 

340 345 350 

Phe Asp Gly Phe Asp Glu Phe Lys Phe Arg Phe Thr Asp Arg Glu Arg 

355 360 365 

His Cys Ser Pro Thr Asp Pro Thr Ser Val Gin Thr Leu Leu Phe Asn 

370 375 380 

Leu Leu Gin Gly Asn Leu Leu Lys Asn Ala Arg Lys Val Val Thr Ser 
385 390 395 400 

Arg Pro Ala Ala Val Ser Ala Phe Leu Arg Lys Tyr He Arg Thr Glu 

405 410 415 

Phe Asn Leu Lys Gly Phe Ser Glu Gin Gly He Glu Leu Tyr Leu Arg 

420 425 430 

Lys Arg His His Glu Pro Gly Val Ala Asp Arg Leu He Arg Leu Leu 

435 440 445 

Gin Glu Thr Ser Ala Leu His Gly Leu Cys His Leu Pro Val Phe Ser 

450 455 460 

Trp Met Val Ser Lys Cys His Gin Glu Leu Leu Leu Gin Glu Gly Gly 
465 470 475 480 

Ser Pro Lys Thr Thr Thr Asp Met Tyr Leu Leu He Leu Gin His Phe 

485 490 495 

Leu Leu His Ala Thr Pro Pro Asp Ser Ala Ser Gin Gly Leu Gly Pro 

500 505 510 

Ser Leu Leu Arg Gly Arg Leu Pro Thr Leu Leu His Leu Gly Arg Leu 

515 520 525 

Ala Leu Trp Gly Leu Gly Met Cys Cys Tyr Val Phe Ser Ala Gin Gin 

530 535 540 

Leu Gin Ala Ala Gin Val Ser Pro Asp Asp He Ser Leu Gly Phe Leu 
545 550 555 560 

Val Arg Ala Lys Gly Val Val Pro Gly Ser Thr Ala Pro Leu Glu Phe 

565 570 575 

Leu His He Thr Phe Gin Cys Phe Phe Ala Ala Phe Tyr Leu Ala Leu 

580 585 590 

Ser Ala Asp Val Pro Pro Ala Leu Leu Arg His Leu Phe Asn Cys Gly 
595 600 605 
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Arg Pro Gly Asn Ser Pro Met Ala Arg Leu Leu Pro Thr Met Cys He 

610 615 620 

Gin Ala Ser Glu Gly Lys Asp Ser Ser Val Ala Ala Leu Leu Gin Lys 
625 630 635 640 

Ala Glu Pro His Asn Leu Gin He Thr Ala Ala Phe Leu Ala Gly Leu 

645 650 655 

Leu Ser Arg Glu His Trp Gly Leu Leu Ala Glu Cys Gin Thr Ser Glu 

660 665 670 

Lys Ala Leu Leu Arg Arg Gin Ala Cys Ala Arg Trp Cys Leu Ala Arg 

675 680 685 

Ser Leu Arg Lys His Phe His Ser He Pro Pro Ala Ala Pro Gly Glu 

690 695 700 

Ala Lys Ser Val His Ala Met Pro Gly Phe He Trp Leu He Arg Ser 
705 710 715 720 

Leu Tyr Glu Met Gin Glu Glu Arg Leu Ala Arg Lys Ala Ala Arg Gly 

725 730 735 

Leu Asn Val Gly His Leu Lys Leu Thr Phe Cys Ser Val Gly Pro Thr 

740 745 750 

Glu Cys Ala Ala Leu Ala Phe Val Leu Gin His Leu Arg Arg Pro Val 

755 760 765 

Ala Leu Gin Leu Asp Tyr Asn Ser Val Gly Asp He Gly Val Glu Gin 

770 775 780 

Leu Leu Pro Cys Leu Gly Val Cys Lys Ala Leu Tyr Leu Arg Asp Asn 
785 790 795 800 . 

Asn He Ser Asp Arg Gly He Cys Lys Leu He Glu Cys Ala Leu His 

805 810 815 

Cys Glu Gin Leu Gin Lys Leu Ala Leu Gly Asn Asn Tyr He Thr Ala 

820 825 830 

Ala Gly Ala Gin Val Leu Ala Glu Gly Leu Arg Gly Asn Thr Ser Leu 

835 840 845 

Gin Phe Leu Gly Phe Trp Gly Asn Arg Val Gly Asp Glu Gly Ala Gin 

850 855 860 

Ala Leu Ala Glu Ala Leu Gly Asp His Gin Ser Leu Arg Trp Leu Ser 
865 870 875 880 

Leu Val Gly Asn Asn He Gly Ser Val Gly Ala Gin Ala Leu Ala Leu 

885 890 895 

Met Leu Ala Lys Asn Val Met Leu Glu Glu Leu Cys Leu Glu Glu Asn 

900 905 910 

His Leu Gin Asp Glu Gly Val Cys Ser Leu Ala Glu Gly Leu Lys Lys 

915 920 925 

Asn Ser Ser Leu Lys He Leu Asn He Lys He His Ala Ser Gly Phe 

930 935 940 

Asn Lys Leu Leu Glu Ser He Phe Cys He Leu Leu Val Val Glu Ala 
945 950 955 960 

Phe Phe Leu Gin Lys Val Val Lys He Leu Glu Glu Met Val Val Ser 

965 970 975 

Trp Leu Glu Val Arg Leu Ser Asn Asn Cys He Thr Tyr Leu Gly Ala 

980 985 990 

Glu Ala Leu Leu Gin Ala Leu Glu Arg Asn Asp Thr He Leu Glu Val 
995 1000 1005 

Trp 



<210> 108 
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<400> 108 
000 

<210> 109 

<400> 109 
000 

<210> 110 

<400> 110 
000 

<210> 111 

<400> 111 
000 

<210> 112 

<220> 

<221> VARIANT 

<222> (1) . . . (87) 

<223> Xaa = Any Amino Acid 

<400> 112 
000 

<210> 113 

<400> 113 
000 

<210> 114 

<400> 114 
000 

<210> 115 

<400> 115 
000 

<210> 116 

<400> 116 
000 

<210> 117 

<400> 117 
000 

<210> 118 
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<400> 118 
000 

<210> 119 

<400> 119 
000 

<210> 120 

<400> 120 
000 

<210> 121 

<400> 121 
000 

<210> 122 

<400> 122 
000 

<210> 123 

<400> 123 
000 

<210> 124 

<400> 124 
000 

<210> 125 

<400> 125 
000 

<210> 126 

<400> 126 
000 

<210> 127 

<400> 127 
000 

<210> 128 

<400> 128 
000 

<210> 129 
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<400> 129 
000 

<210> 130 

<400> 130 
000 

<210> 131 

<400> 131 
000 

<210> 132 

<400> 132 
000 

<210> 133 

<400> 133 
000 

<210> 134 

<400> 134 
000 

<210> 135 

<400> 135 
000 

<210> 136 

<400> 136 
000 

<210> 137 

<400> 137 
000 

<210> 138 

<400> 138 
000 

<210> 139 

<400> 139 
000 

<210> 140 
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<400> 140 
000 

<210> 141 

<400> 141 
000 

<210> 142 

<400> 142 
000 

<210> 143 

<400> 143 
000 

<210> 144 • 

<400> 144 
000 

<210> 145 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 145 

ccagaattca tggccgacaa ggtcctgaag 3 0 

<210> 146 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 146 

ccactcgagc taatttccag gtatcggacc 30 

<210> 147 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 147 
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gaagacagtt acctggcaga 

<210> 148 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 148 

ttgtattctg aacatggcac c 

<210> 149 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<2 23> primer 
<400> 149 

gatcatcatc caggccgccc gtggtgacag ccctgg 

<210> 150 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 150 

ccagggctgt caccacgggc ggcctggatg atgatc 

<210> 151 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 151 

cggaattcat ggccgacaag gtcctg 

<210> 152 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 152 
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cgctcgagtt agtcttgcat attaaggtaa tttccaga 38 

<210> 153 
<211> 23 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> primer 
<400> 153 

catgtgaatg atccctctag cag 23 

<210> 154 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 154 

gggctcggct atcgtgctct a 21 

<210> 155 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 155 

acgatagccg agcccttatt c 21 

<210> 156 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 156 

gtatggaatg ttctgaatcg c 21 

<210> 157 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 157 
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cccggatcca tgaatttcat aaaggacaat age 33 



<2i0> 158 
<21X> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 158 

cccttcgaac aagtcctgaa atagaggata 

<210> 159 
<211> 24 
<212> DNA 

<213> Artificial Sequence 



30 



<220> 

<223> primer 
<400> 159 

ggtggagcag gatgetgeta gagg 24 

<210> 160 
<2U> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 160 

cacagtggtc caggctccga atgaagtca 2 9 

<210> 161 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 161 

catcatttgc tgegagaagg tggag 25 

<210> 162 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 162 
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ttaacttgga taacacttgg ctaag 25 

<210> 163 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 163 

gtaaacatca tttgctgcga gaa 23 

<210> 164 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 164 

cccgggcagg tagaagatgc tat 23 

<210> 165 
<211> 25 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> primer 
<400> 165 

aatttcataa aggacaatag ccgag 25 

<210> 166 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 166 

tgtctactgt actttctaag ctgtt 25 

<210> 167 
<211> 225 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . . (225) 
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<400> 167 

gag agt act ccc tea gag ate ata gaa aga gaa aga aaa aag ttg ctt 48 

Glu Ser Thr Pro Ser GIu He He Glu Arg Glu Arg Lys Lys Leu Leu 
15 10 15 



gaa ate ctt caa cat gat cct gat tct ate tta gac acg tta act tct 96 
Glu He Leu Gin His Asp Pro Asp Ser He Leu Asp Thr Leu Thr Ser 
20 25 30 



egg agg ctg att tct gag gaa gag tat gag act ctg gag aat gtt aca 144 
Arg Arg Leu He Ser Glu Glu Glu Tyr Glu Thr Leu Glu Asn Val Thr 
35 40 45 



gat etc ctg aag aaa agt egg aag ctg tta att ttg gta cag aaa aag 192 
Asp Leu Leu Lys Lys Ser Arg Lys Leu Leu lie Leu Val Gin Lys Lys 
50 55 60 



gga gag gcg ace tgt cag cat ttt etc aag tgt 225 
Gly Glu Ala Thr Cys Gin His Phe Leu Lys Cys 
65 70 75 



<210> 168 

<211> 75 

<212> PRT 

<213> Homo sapiens 



<400> 168 

Glu Ser Thr Pro Ser Glu He He 

1 5 
Glu He Leu Gin His Asp Pro Asp 
20 

Arg Arg Leu He Ser Glu Glu Glu 

35 40 
Asp Leu Leu Lys Lys Ser Arg Lys 

50 55 
Gly Glu Ala Thr Cys Gin His Phe 
65 70 



Glu Arg Glu Arg Lys Lys Leu Leu 

10 15 
Ser He Leu Asp Thr- Leu Thr Ser 
25 30 
Tyr Glu Thr Leu Glu Asn Val Thr 
45 

Leu Leu He Leu Val Gin Lys Lys 
60 

Leu Lys Cys 
75 



<210> 169 
<211> 228 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . . (228) 



<400> 169 

atg tgc teg cag gag get ttt cag gca cag agg age cag ctg gtc gag 48 

Met Cys Ser Gin Glu Ala Phe Gin Ala Gin Arg Ser Gin Leu Val Glu 
15 10 15 



ctg ctg gtc tea ggg tec ctg gaa ggc ttc gag agt gtc ctg gac tgg 96 
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Leu Leu Val Ser Gly Ser Leu Glu Gly Phe GIu Ser Val Leu Asp Trp 
20 25 30 

ctg ctg tec tgg gag gtc etc tec tgg gag gac tac gag ggc ttc cac 144 
Leu Leu Ser Trp Glu Val Leu Ser Trp Glu Asp Tyr Glu Gly Phe His 
35 40 45 

etc ctg ggc cag cct etc tec cac ttg gec agg cgc ctt ctg gac ace 192 
Leu Leu Gly Gin Pro Leu Ser His Leu Ala Arg Arg Leu Leu Asp Thr 
50 55 60 

gtc tgg aat aag ggt act tgg gec tgt cag aag etc 228 
Val Trp Asn Lys Gly Thr Trp Ala Cys Gin Lys Leu 
65 70 75 



<210> 170 
<211> 76 
<212> PRT 

<213> Homo sapiens 
<400> 170 

Met Cys Ser Gin Glu Ala Phe Gin Ala Gin Arg Ser Gin Leu Val Glu 

15 10 15 

Leu Leu Val, Ser Gly Ser Leu Glu Gly Phe Glu Ser Val Leu Asp Trp 

20 25 30 

Leu Leu Ser Trp Glu Val Leu Ser Trp Glu Asp Tyr Glu Gly Phe His 

35 40 45 

Leu Leu Gly Gin Pro Leu Ser His Leu Ala Arg Arg Leu Leu Asp Thr 

50 55 60 

Val Trp Asn Lys Gly Thr Trp Ala Cys Gin Lys Leu 
65 70 75 



<210> 171 
<211> 243 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . . (243) 

<400> 171 

cca gee cga gac ctg cag agt cac egg cca gec att gtc agg agg etc 48 
Pro Ala Arg Asp Leu Gin Ser His Arg Pro Ala He Val Arg Arg Leu 
15 10 15 

cac age cat gtg gag aac atg ctg gac ctg gca tgg gag egg ggt ttc 96 
His Ser His Val Glu Asn Met Leu Asp Leu Ala Trp Glu Arg Gly Phe 
20 25 30 

gtc age cag tat gaa tgt gat gaa ate agg ttg ccg ate ttc aca ccg 144 
Val Ser Gin Tyr Glu Cys Asp Glu He Arg Leu Pro He Phe Thr Pro 
35 40 45 
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tec cag agg gca aga agg ctg ctt gat 
Ser Gin Arg Ala Arg Arg Leu Leu Asp 
50 55 

gga ttg get gec tte ctt eta caa cat 
Gly Leu Ala Ala Phe Leu Leu Gin His 
65 70 

ttg 
Leu 



ctt gec acg gtg aaa gcg aat 192 
Leu Ala Thr Val Lys Ala Asn 
60 

gtt cag gaa tta cca gtc cca 240 
Val Gin Glu Leu Pro Val Pro 
75 80 

243 



<210> 172 
<211> 81 
<212> PRT 

<213> Homo sapiens 
<400> 172 

Pro Ala Arg Asp Leu Gin Ser His 

1 5 
His Ser His Val Glu Asn Met Leu 

20 

Val Ser Gin Tyr Glu Cys Asp Glu 

35 40 
Ser Gin Arg Ala Arg Arg Leu Leu 

50 55 
Gly Leu Ala Ala Phe Leu Leu Gin 
65 70 
Leu 



Arg Pro Ala He Val Arg Arg Leu 

10 15 
Asp Leu Ala Trp Glu Arg Gly Phe 
25 30 
He Arg Leu Pro He Phe Thr Pro 
45 

Asp Leu Ala Thr Val Lys Ala Asn 
60 

His Val Gin Glu Leu Pro Val Pro 
75 80 



<210> 173 
<211> 888 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . . (888) 

<400> 173 

gac gat gcg gac act gtg ctg gtg gtg ggt gag gcg ggc agt ggc aag 4 8 

Asp Asp Ala Asp Thr Val Leu Val Val Gly Glu Ala Gly Ser Gly Lys 

15 10 15 



age acg etc ctg cag egg ctg cac ttg ctg tgg get gca ggg caa gac 96 
Ser Thr Leu Leu Gin Arg Leu His Leu Leu Trp Ala Ala Gly Gin Asp 
20 25 30 

ttc cag gaa ttt etc ttt gtc tte cca ttc age tgc egg cag ctg cag 144 
Phe Gin Glu Phe Leu Phe Val Phe Pro Phe Ser Cys Arg Gin Leu Gin 
35 40 45 
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tgc atg gcc aaa cca etc tct gtg egg act eta etc ttt gag cac tgc 192 
Cys Met Ala Lys Pro Leu Ser Val Arg Thr Leu Leu Phe Glu His Cys 
50 55 60 

tgt tgg cct gat gtt ggt caa gaa gac ate ttc cag tta etc ctt gac 240 
Cys Trp Pro Asp Val Gly Gin Glu Asp lie Phe Gin Leu Leu Leu Asp 
65 70 75 80 

cac cct gac cgt gtc ctg tta acc ttt gat ggc ttt gac gag ttc aag 288 
His Pro Asp Arg Val Leu Leu Thr Phe Asp Gly Phe Asp Glu Phe Lys 
85 90 95 

ttc agg ttc acg gat cgt gaa cgc cac tgc tec ccg acc gac ccc acc 336 
Phe Arg Phe Thr Asp Arg Glu Arg His Cys Ser Pro Thr Asp Pro Thr 
100 105 110 

tct gtc cag acc ctg etc ttc aac ctt ctg cag ggc aac ctg ctg aag 384 
Ser Val Gin Thr Leu Leu Phe Asn Leu Leu Gin Gly Asn Leu Leu Lys 
115 120 125 

aat gcc cgc aag gtg gtg acc age cgt ccg gcc get gtg teg gcg ttc 432 
Asn Ala Arg Lys Val Val Thr Ser Arg Pro Ala Ala Val Ser Ala Phe 
130 135 140 

etc agg aag tac ate cgc acc gag ttc aac etc aag ggc ttc tct gaa 48 0 
Leu Arg Lys Tyr lie Arg Thr Glu Phe Asn Leu Lys Gly Phe Ser Glu 
145 150 155 160 

cag ggc ate gag ctg tac ctg agg aag cgc cat cat gag ccc ggg gtg 52 8 
Gin Gly lie Glu Leu Tyr Leu Arg Lys Arg His His Glu Pro Gly Val 
165 170 175 

gcg gac cgc etc ate cgc ctg etc caa gag acc tea gcc ctg cac ggt 576 
Ala Asp Arg Leu lie Arg Leu Leu Gin Glu Thr Ser Ala Leu His Gly 
180 185 190 

ttg tgc cac ctg cct gtc ttc tea tgg atg gtg tec aaa tgc cac cag 624 
Leu Cys His Leu Pro Val Phe Ser Trp Met Val Ser Lys Cys His Gin 
195 200 205 

gaa ctg ttg ctg cag gag ggg ggg tec cca aag acc act aca gat atg 672 
Glu Leu Leu Leu Gin Glu Gly Gly Ser Pro Lys Thr Thr Thr Asp Met 
210 215 220 

tac ctg ctg att ctg cag cat ttt ctg ctg cat gcc acc ccc cca gac 720 
Tyr Leu Leu lie Leu Gin His Phe Leu Leu His Ala Thr Pro Pro Asp 
225 230 235 240 

tea get tec caa ggt ctg gga ccc agt ctt ctt egg ggc cgc etc ccc 768 
Ser Ala Ser Gin Gly Leu Gly Pro Ser Leu Leu Arg Gly Arg Leu Pro 
245 250 255 

acc etc ctg cac ctg ggc aga ctg get ctg tgg ggc ctg ggc atg tgc 816 
Thr Leu Leu His Leu Gly Arg Leu Ala Leu Trp Gly Leu Gly Met Cys 
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260 265 270 



tgc tac gtg 


ttc tea gec cag 


cag 


etc 


cag 


gca 


gca 


cag 


gtc 


age 


cct 


Cys Tyr Val 


Phe Ser Ala Gin 


Gin 


Leu 


Gin 


Ala 


Ala 


Gin 


Val 


Ser 


Pro 


275 




280 










285 








gat gac att 


tct ctt ggc ttc 


ctg 


















Asp Asp He 


Ser Leu Gly Phe 


Leu 


















290 


295 




















<210> 174 






















<211> 296 






















<212> PRT 






















<213> Homo sapiens 




















<400> 174 






















Asp Asp Ala 


Asp Thr Val Leu 


Veil 


V ct -L 


Gly Glu Ala 


uiy 


Ser 


j- y 


Lys 


1 


5 






10 










_L -J 




Ser Thr Leu 


Leu Gin Arg Leu 


rllS 


Leu 


Leu 


Trp 


Ala 


Kid 


vjxy 


(jjin 


Asp 




20 




O £ 

A D 










i r> 
o u 






Phe Gin Glu 


Phe Leu Phe Val 


Pne 


Pro 


Phe 


Ser 


Cys 


Arg 


Gin 


Leu 


Gin 


35 




A C\ 

4 U 










a a 








Cys Met Ala 


Lys Pro Leu Ser 


Val 


Arg 


Thr 


Leu 


Leu 


Pne 


Glu 


Hrs 


Cys 


50 


55 










60 










Cys Trp Pro 


Asp Val Gly Gin 


G1U 


Asp 


He 


Phe 


Gin 


Leu 


Leu 


Leu 


Asp 


65 


70 








75 










o U 


His Pro Asp 


Arg Val Leu Leu 


Thr 


Phe 


Asp 


Gly 


Phe 


Asp 


(jIU 


rfie 


Lys 




85 






90 














Phe Arg Phe 


Thr Asp Arg Glu 


Arg 


His 


Cys 


Ser 


Pro 


Thr 


Asp 


Pro 


inr 




100 




105 










11U 






Ser Val Gin 


Thr Leu Leu Phe 


Asn 


Leu 


Leu 


Gin 


Gly 


Asn 


Leu 


Leu. 


Lys 


115 




12 0 










125 








Asn Ala Arg 


Lys Val Val Thr 


OCX 


Arg 


Pro 


Ala 


Ala 


V cl X 


O Ci 


7\ 1 ^ 


rr Xlc 


130 


135 










140 










Leu Arg Lys 


Tyr He Arg Thr 


Glu 


Phe 


Asn 


Leu 


Lys 


Gly 


Phe 


Ser 


Glu 


145 


150 








155 










160 


Gin Gly He 


Glu Leu Tyr Leu 


Arg 


Lys Arg His 


His 


Glu 


Pro 


Glv 


Val 




165 






170 










175 




Ala Asp Arg 


Leu He Arg Leu 


Leu 


Gin 


Glu 


Thr 


Ser 


Ala 


Leu 


His 


Glv 




180 




185 










190 






Leu Cys His 


Leu Pro Val Phe 


Ser 


Trp Met Val 


Ser 


Lys 


Cys 


His 


Gin 


195 




200 










205 








Glu Leu Leu 


Leu Gin Glu Gly 


Gly 


Ser 


Pro 


Lys 


Thr 


Thr 


Thr 


Asp 


Met 


210 


215 










220 










Tyr Leu Leu 


He Leu Gin His 


Phe 


Leu 


Leu 


His 


Ala 


Thr 


Pro 


Pro 


Asp 


225 


230 








235 










240 


Ser Ala Ser 


Gin Gly Leu Gly 


Pro 


Ser Leu Leu Arg 


Gly 


Arg 


Leu 


Pro 




245 






250 










255 




Thr Leu Leu 


His Leu Gly Arg 


Leu 


Ala Leu Trp Gly 


Leu 


Gly 


Met 


Cys 




260 




265 










270 






Cys Tyr Val 


Phe Ser Ala Gin 


Gin 


Leu 


Gin 


Ala 


Ala 


Gin 


Val 


Ser 


Pro 


275 




280 










285 








Asp Asp He 


Ser Leu Gly Phe 


Leu 



















290 295 
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<210> 175 

<211> 1209 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . , (1209) 

<400> 175 

gag ccc ggg gtg gcg gac cgc etc ate cgc ctg etc caa gag acc tea 48 
Glu Pro Gly Val Ala Asp Arg Leu He Arg Leu Leu Gin Glu Thr Ser 
IB 10 15 

gee ctg cac ggt ttg tgc cac ctg cct gtc ttc tea tgg atg gtg tec 96 
Ala Leu His Gly Leu Cys His Leu Pro Val Phe Ser Trp Met Val Ser 
20 25 30 

aaa tgc cac cag gaa ctg ttg ctg cag gag ggg ggg tec cca aag acc 144 
Lys Cys His Gin Glu Leu Leu Leu Gin Glu Gly Gly Ser Pro Lys Thr 
35 40 45 

act aca gat atg tac ctg ctg att ctg cag cat ttt ctg ctg cat gee 192 
Thr Thr Asp Met Tyr Leu Leu He Leu Gin His Phe Leu Leu His Ala 
50 55 60 

acc ccc cca gac tea get tec caa ggt ctg gga ccc agt ctt ctt egg 24 0 
Thr Pro Pro Asp Ser Ala Ser Gin Gly Leu Gly Pro Ser Leu Leu Arg 
65 70 75 80 

gge cgc etc ccc acc etc ctg cac ctg ggc aga ctg get ctg tgg ggc 2 88 
Gly Arg Leu Pro Thr Leu Leu His Leu Gly Arg Leu Ala Leu Trp Gly 
85 90 95 

ctg ggc atg tgc tgc tac gtg ttc tea gee cag cag etc cag gca gca 336 
Leu Gly Met Cys Cys Tyr Val Phe Ser Ala Gin Gin Leu Gin Ala Ala 
100 105 110 

cag gtc age cct gat gac att tct ctt ggc ttc ctg gtg cgt gee aaa 3 84 
Gin Val Ser Pro Asp Asp He Ser Leu Gly Phe Leu Val Arg Ala Lys 
115 120 125 

ggt gtc gtg cca ggg agt acg gcg ccc ctg gaa ttc ctt cac ate act 432 
Gly Val Val Pro Gly Ser Thr Ala Pro Leu Glu Phe Leu His He Thr 
130 135 140 

ttc cag tgc ttc ttt gec gcg ttc tac ctg. gca etc agt get gat gtg 480 
Phe Gin Cys Phe Phe Ala Ala Phe Tyr Leu Ala Leu Ser Ala Asp Val 
145 150 155 160 

cca cca get ttg etc aga cac etc ttc aat tgt ggc agg cca ggc aac 528 
Pro Pro Ala Leu Leu Arg His Leu Phe Asn Cys Gly Arg Pro Gly Asn 
165 170 175 
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tea cca atg gec agg etc ctg ccc acg atg tgc ate cag gec teg gag 576 
Ser Pro Met Ala Arg Leu Leu Pro Thr Met Cys He Gin Ala Ser Glu 
180 185 190 

gga aag gac age age gtg gca get ttg ctg cag aag gec gag ccg cac 624 
Gly Lys Asp Ser Ser Val Ala Ala Leu Leu Gin Lys Ala Glu Pro His 
195 200 205 

aac ctt cag ate aca gca gee ttc ctg gca ggg ctg ttg tec egg gag 672 
Asn Leu Gin He Thr Ala Ala Phe Leu Ala Gly Leu Leu Ser Arg Glu 
210 215 220 

cac tgg ggc ctg ctg get gag tgc cag aca tct gag aag gec ctg etc 720 
His Trp Gly Leu Leu Ala Glu Cys Gin Thr Ser Glu Lys Ala Leu Leu 
225 230 235 240 

c 99" c 9c cag gec tgt gec cgc tgg tgt ctg gec cgc age etc cgc aag 768 
Arg Arg Gin Ala Cys Ala Arg Trp Cys Leu Ala Arg Ser Leu Arg Lys 
245 250 255 

cac ttc cac tec ate ccg cca get gca ccg ggt gag gec aag age gtg 816 
His Phe His Ser He Pro Pro Ala Ala Pro Gly Glu Ala Lys Ser Val 
260 265 270 

cat gee atg ccc ggg ttc ate tgg etc ate egg age ctg tac gag atg 864 
His Ala Met Pro Gly Phe He Trp Leu He Arg Ser Leu Tyr Glu Met 
275 280 285 

cag gag gag egg ctg get egg aag get gca cgt ggc ctg aat gtt ggg 912 
Gin Glu Glu Arg Leu Ala Arg Lys Ala Ala Arg Gly Leu Asn Val Gly 
290 295 300 

cac etc aag ttg aca ttt tgc agt gtg ggc ccc act gag tgt get gec 960 
His Leu Lys Leu Thr Phe Cys Ser Val Gly Pro Thr Glu Cys Ala Ala 
305 310 315 320 

ctg gec ttt gtg ctg cag cac etc egg egg ccc gtg gec ctg cag ctg 1008 
Leu Ala Phe Val Leu Gin His Leu Arg Arg Pro Val Ala Leu Gin Leu 
325 330 335 

gac tac aac tct gtg ggt gac att ggc gtg gag cag ctg ctg cct tgc 1056 
Asp Tyr Asn Ser Val Gly Asp He Gly Val Glu Gin Leu Leu Pro Cys 
340 345 350 

ctt ggt gtc tgc aag get ctg tat ttg cgc gat aac aat ate tea gac 1104 
Leu Gly Val Cys Lys Ala Leu Tyr Leu Arg Asp Asn Asn He Ser Asp 
355 360 365 

cga ggc ate tgc aag etc att gaa tgt get ctt cac tgc gag caa ttg 1152 
Arg Gly He Cys Lys Leu He Glu Cys Ala Leu His Cys Glu Gin Leu 
370 375 380 

cag aag tta gcg ctg ggg aat aac tac ate act gec gcg gga gec caa 12 00 
Gin Lys Leu Ala Leu Gly Asn Asn Tyr He Thr Ala Ala Gly Ala Gin 
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385 



390 



395 



400 



gtg ctg gcc 



1209 



Val Leu Ala 



<210> 176 
<211> 403 
<212> PRT 

<213> Homo sapiens 
<400> 176 

Glu Pro Gly Val Ala Asp Arg Leu lie Arg Leu Leu Gin Glu Thr Ser 

15 10 15 

Ala Leu His Gly Leu Cys His Leu Pro Val Phe Ser Trp Met Val Ser 

20 25 30 

Lys Cys His Gin Glu Leu Leu Leu Gin Glu Gly Gly Ser Pro Lys Thr 

35 40 45 

Thr Thr Asp Met Tyr Leu Leu He Leu Gin His Phe Leu Leu His Ala 

50 55 60 

Thr Pro Pro Asp Ser Ala Ser Gin Gly Leu Gly Pro Ser Leu Leu Arg 
65 70 75 80 

Gly Arg Leu Pro Thr Leu Leu His Leu Gly Arg Leu Ala Leu Trp Gly 

85 90 95 

Leu Gly Met Cys Cys Tyr Val Phe Ser Ala Gin Gin Leu Gin Ala Ala 

100 105 110 

Gin Val Ser Pro Asp Asp He Ser Leu Gly Phe Leu Val Arg Ala Lys 

115 120 125 

Gly Val Val Pro Gly Ser Thr Ala Pro Leu Glu Phe Leu His He Thr 

130 135 140 

Phe Gin Cys Phe Phe Ala Ala Phe Tyr Leu Ala Leu Ser Ala Asp Val 
145 150 155 160 

Pro Pro Ala Leu Leu Arg His Leu Phe Asn Cys Gly Arg Pro Gly Asn 

165 170 175 

Ser Pro Met Ala Arg Leu Leu Pro Thr Met Cys He Gin Ala Ser Glu 

180 185 190 

Gly Lys Asp Ser Ser Val Ala Ala Leu Leu Gin Lys Ala Glu Pro His 

195 200 205 

Asn Leu Gin He Thr Ala Ala Phe Leu Ala Gly Leu Leu Ser Arg Glu 

210 215 220 

His Trp Gly Leu Leu Ala Glu Cys Gin Thr Ser Glu Lys Ala Leu Leu 
225 230 235 240 

Arg Arg Gin Ala Cys Ala Arg Trp Cys Leu Ala Arg Ser Leu Arg Lys 

245 250 255 

His Phe His Ser He Pro Pro Ala Ala Pro Gly Glu Ala Lys Ser Val 

260 265 270 

His Ala Met Pro Gly Phe He Trp Leu He Arg Ser Leu Tyr Glu Met 

275 280 285 

Gin Glu Glu Arg Leu Ala Arg Lys Ala Ala Arg Gly Leu Asn Val Gly 

290 295 300 

His Leu Lys Leu Thr Phe Cys Ser Val Gly Pro Thr Glu Cys Ala Ala 
305 310 315 320 

Leu Ala Phe Val Leu Gin His Leu Arg Arg Pro Val Ala Leu Gin Leu 



325 



330 



335 
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Asp 


Tyr Asn 


Ser 


Val 


Gly Asp 


lie 


Gly Val 


Glu 


Gin 


Leu 


Leu 


Pro Cys 






340 








345 








350 




Leu 


Gly Val 


Cys 


Lys 


Ala Leu 


Tyr 


Leu Arg 


Asp Asn 


Asn 


He 


Ser Asp 




355 








360 








365 






Arg 


Gly lie 


Cys 


Lys 


Leu lie 


Glu 


Cys Ala 


Leu 


His 


Cys 


Glu 


Gin Leu 




370 






375 








380 








Gin 


Lys Leu 


Ala 


Leu 


Gly Asn Asn 


Tyr He 


Thr 


Ala 


Ala 


Gly Ala Gin 


385 








390 






395 








400 



Val Leu Ala 



<210> 177 
<211> 261 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . . (261) 

<400> 177 

atg aat ttc ata aag gac aat age cga gec ctt att caa aga atg gga 48 
Met Asn Phe He Lys Asp Asn Ser Arg Ala Leu He Gin Arg Met Gly 
15 10 15 

atg act gtt ata aag caa ate aca gat gac eta ttt gta tgg aat gtt 96 
Met Thr Val He Lys Gin He Thr Asp Asp Leu Phe Val Trp Asn Val 
20 25 30 

ctg aat cgc gaa gaa gta aac ate att tgc tgc gag aag gtg gag cag 144 
Leu Asn Arg Glu Glu Val Asn He He Cys Cys Glu Lys Val Glu Gin 
35 40 45 

gat get get aga ggg ate att cac atg att ttg aaa aag ggt tea gag 192 
Asp Ala Ala Arg Gly He He His Met He Leu Lys Lys Gly Ser Glu 
50 55 60 

tec tgt aac etc ttt ctt aaa tec ctt aag gag tgg aac tat cct eta 240 
Ser Cys Asn Leu Phe Leu Lys Ser Leu Lys Glu Trp Asn Tyr Pro Leu 
65 70 75 80 

ttt cag gac ttg aat gga caa 261 
Phe Gin Asp Leu Asn Gly Gin 
85 



<210> 178 

<211> 87 

<212> PRT 

<213> Homo sapiens 

<400> 178 

Met Asn Phe He Lys Asp Asn Ser Arg Ala Leu He Gin Arg Met Gly 
1 5 10 15 
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iviet inr vai lie Lys 


Gin 


He 


Thr 


Asp 


Asp 


Leu 


Phe Val Trp Asn Val 


20 








25 






30 


Leu Asn Arg Glu Glu 


Val 


Asn 


He 


He 


Cys 


Cys 


Glu Lys Val Glu Gin 


35 






40 








45 


Asp Ala Ala Arg Gly 


He 


He 


His 


Met 


He 


Leu 


Lys Lys Gly Ser Glu 


50 




55 










60 


Ser Cys Asn Leu Phe 


Leu 


Lys 


Ser 


Leu 


Lys 


Glu 


Trp Asn Tyr Pro Leu 


65 


70 










75 


80 


Phe Gin Asp Leu Asn 


Gly 


Gin 













85 



<210> 179 
<211> 891 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . . (891) 

<400> 179 

ctt cag age ccc tgc ate att gaa ggg gaa tct ggc aaa ggc aag tec 48 
Leu Gin Ser Pro Cys He He Glu Gly Glu Ser Gly Lys Gly Lys Ser 
15 10 15 

act ctg ctg cag cgc att gec atg etc tgg ggc tec gga aag tgc aag 96 
Thr Leu Leu Gin Arg He Ala Met Leu Trp Gly Ser Gly Lys Cys Lys 
20 25 30 

get ctg acc aag ttc aaa ttc gtc ttc ttc etc cgt etc age agg gee 144 
Ala Leu Thr Lys Phe Lys Phe Val Phe Phe Leu Arg Leu Ser Arg Ala 
35 40' 45 

cag ggt gga ctt ttt gaa acc etc tgt gat caa etc ctg gat ata cct 192 
Gin Gly Gly Leu Phe Glu Thr Leu Cys Asp Gin Leu Leu Asp He Pro 
50 55 60 

ggc aca ate agg aag cag aca ttc atg gec atg ctg ctg aag ctg egg 240 
Gly Thr He Arg Lys Gin Thr Phe Met Ala Met Leu Leu Lys Leu Arg 
65 70 75 80 

cag agg gtt ctt ttc ctt ctt gat ggc tac aat gaa ttc aag ccc cag 288 
Gin Arg Val Leu Phe Leu Leu Asp Gly Tyr Asn Glu Phe Lys Pro Gin 
85 90 95 

aac tgc cca gaa ate gaa gee ctg ata aag gaa aac cac cgc ttc aag 336 
Asn Cys Pro Glu lie Glu Ala Leu He Lys Glu Asn His Arg Phe Lys 
100 105 110 

aac atg gtc ate gtc acc act acc act gag tgc ctg agg cac ata egg 384 
Asn Met Val He Val Thr Thr Thr Thr Glu Cys Leu Arg His He Arg 
115 120 125 

cag ttt ggt gec ctg act get gag gtg ggg gat atg aca gaa gac age 432 
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Gin Phe Gly Ala Leu Thr Ala Glu Val Gly Asp Met Thr Glu Asp Ser 
130 135 140 

gcc cag get etc ate cga gaa gtg ctg ate aag gag ctt get gaa ggc 480 
Ala Gin Ala Leu He Arg Glu Val Leu He Lys Glu Leu Ala Glu Gly 
145 150 155 160 

ttg ttg etc caa att cag aaa tec agg tgc ttg agg aat etc atg aag 528 
Leu Leu Leu Gin He Gin Lys Ser Arg Cys Leu Arg Asn Leu Met Lys 
165 170 175 

acc cct etc ttt gtg gtc ate act tgt gca ate cag atg ggt gaa agt 576 
Thr Pro Leu Phe Val Val He Thr Cys Ala He Gin Met Gly Glu Ser 
180 185 190 

gag ttc cac tct cac aca caa aca acg ctg ttc cat acc ttc tat gat 624 
Glu Phe His Ser His Thr Gin Thr Thr Leu Phe His Thr Phe Tyr Asp 
195 200 205 

ctg ttg ata cag aaa aac aaa cac aaa cat aaa ggt gtg get gca agt 6 72 
Leu Leu He Gin Lys Asn Lys His Lys His Lys Gly Val Ala Ala Ser 
210 215 220 

gac ttc att egg age ctg gac cac cgt gga gac eta get ctg gag ggt 72 0 
Asp Phe He Arg Ser Leu Asp His Arg Gly Asp Leu Ala Leu Glu Gly 
225 230 235 240 

gtg ttc tec cac aag ttt gat ttc gaa ctg cag gat gtg tec age gtg 768 
Val Phe Ser His Lys Phe Asp Phe Glu Leu Gin Asp Val Ser Ser Val 
245 250 255 

aat gag gat gtc ctg ctg aca act ggg etc etc tgt aaa tat aca get 816 
Asn Glu Asp Val Leu Leu Thr Thr Gly Leu Leu Cys Lys Tyr Thr Ala 
260 265 270 

caa agg ttc aag cca aag tat aaa ttc ttt cac aag tea ttc cag gag 864 
Gin Arg Phe Lys Pro Lys Tyr Lys Phe Phe His Lys Ser Phe Gin Glu 
275 280 285 

tac aca gca gga cga aga etc age agt 891 
Tyr Thr Ala Gly Arg Arg Leu Ser Ser 
290 295 



<210> 180 

<211> 297 

<212> PRT 

<213> Homo sapiens 



<400> 180 



Leu 


Gin 


Ser 


Pro 


Cys 


He 


He 


Glu 


Gly 


Glu 


Ser 


Gly 


Lys 


Gly 


Lys 


Ser 


1 








5 










10 










15 




Thr 


Leu 


Leu 


Gin 


Arg 


He 


Ala 


Met 


Leu 


Trp 


Gly 


Ser 


Gly 


Lys 


Cys 


Lys 








20 










25 










30 






Ala 


Leu 


Thr 


Lys 


Phe 


Lys 


Phe 


Val 


Phe 


Phe 


Leu 


Arg 


Leu 


Ser 


Arg 


Ala 
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35 










40 


45 








Gin 


Gly 


Gly Leu Phe 


Glu 


Thr 


Leu Cys 


Asp Gin Leu Leu 


Asp 


He 


Pro 




50 










55 




60 








Gly 


Thr 


He 


Arg 


Lys 


Gin 


Thr 


Phe Met 


Ala Met Leu Leu 


Lys 


Leu 


Arg 


65 










70 






75 






80 


Gin 


Arg 


Val 


Leu 


Phe 


Leu 


Leu Asp Gly 


Tyr Asn Glu Phe 


Lys 


Pro 


Gin 










85 








90 




95 




Asn 


Cys 


Pro 


Glu 


He 


Glu 


Ala 


Leu He 


Lys Glu Asn His 


Arg 


Phe 


Lys 








100 








105 




110 






Asn 


Met 


Val 


He 


Val 


Thr 


Thr 


Thr Thr 


Glu Cys Leu Arg 


His 


He 


Arg 






115 










120 


125 








Gin 


Phe 


Gly Ala Leu 


Thr 


Ala 


Glu Val 


Gly Asp Met Thr 


Glu Asp 


Ser 




13 0 










135 




140 








Ala 


Gin 


Ala 


Leu 


He 


Arg 


Glu 


Val Leu 


He Lys Glu Leu 


Ala 


Glu 


Gly 


145 










150 






155 






160 


Leu 


Leu 


Leu 


Gin 


He 


Gin 


Lys 


Ser Arg 


Cys Leu Arg Asn 


Leu 


Met 


Lys 










165 








170 




175 




Thr 


Pro 


Leu 


Phe 


Val 


Val 


He 


Thr Cys 


Ala He Gin Met 


Gly Glu 


Ser 








180 








185 




190 






Glu 


Phe 


His 


Ser 


His 


Thr 


Gin 


Thr Thr 


Leu Phe His Thr 


Phe 


Tyr 


Asp 






195 










200 


205 








Leu 


Leu 


He 


Gin 


Lys 


Asn 


Lys 


His Lys 


His Lys Gly Val 


Ala 


Ala 


Ser 




210 










215 




220 








Asp 


Phe 


He 


Arg 


Ser 


Leu Asp His Arg 


Gly Asp Leu Ala 


Leu 


Glu 


Gly 


225 










230 






235 






240 


Val 


Phe 


Ser 


His 


Lys 


Phe 


Asp 


Phe Glu 


Leu Gin Asp Val 


Ser 


Ser 


Val 










245 








250 




255 




Asn 


Glu 


Asp 


Val 


Leu 


Leu 


Thr 


Thr Gly 


Leu Leu Cys Lys 


Tyr 


Thr 


Ala 








260 








265 




270 






Gin 


Arg 


Phe 


Lys 


Pro 


Lys 


Tyr 


Lys Phe 


Phe His Lys Ser 


Phe 


Gin 


Glu 






275 










280 


285 








Tyr 


Thr 


Ala 


Gly Arg 


Arg 


Leu 


Ser Ser 












290 










295 













<210> 181 
<211> 618 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . . (618) 

<400> 181 

ggt aac ttg aag aac ctt aca aag etc ata atg gat aac ata aag atg 48 
Gly Asn Leu Lys Asn Leu Thr Lys Leu He Met Asp Asn lie Lys Met 
15 10 15 

aat gaa gaa gat get ata aaa eta get gaa ggc ctg aaa aac ctg aag 96 
Asn Glu Glu Asp Ala He Lys Leu Ala Glu Gly Leu Lys Asn Leu Lys 
20 25 30 

aag atg tgt tta ttt cat ttg acc cac ttg tct gac att gga gag gga 144 
Lys Met Cys Leu Phe His Leu Thr His Leu Ser Asp He Gly Glu Gly 
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35 40 45 

atg gat tac ata gtc aag tct ctg tea agt gaa ccc tgt gac ctt gaa 192 

Met Asp Tyr lie Val Lys Ser Leu Ser Ser Glu Pro Cys Asp Leu Glu 
50 55 60 



gaa att caa tta gtc tec tgc tgc ttg tct gca aat gca gtg aaa ate 
Glu lie Gin Leu Val Ser Cys Cys Leu Ser Ala Asn Ala Val Lys lie 
65 70 75 80 



240 



eta get cag aat ctt cac aat ttg gtc aaa ctg age att ctt gat tta 288 

Leu Ala Gin Asn Leu His Asn Leu Val Lys Leu Ser lie Leu Asp Leu 
85 90 95 

tea gaa aat tac ctg gaa aaa gat gga aat gaa get ctt cat gaa ctg 336 

Ser Glu Asn Tyr Leu Glu Lys Asp Gly Asn Glu Ala Leu His Glu Leu 

100 105 110 

ate gac agg atg aac gtg eta gaa cag etc ace gca ctg atg ctg ccc 384 

He Asp Arg Met Asn Val Leu Glu Gin Leu Thr Ala Leu Met Leu Pro 

115 120 125 

tgg 99 c tgt 9 ac gtg caa ggc age ctg age age ctg ttg aaa cat ttg 432 

Trp Gly Cys Asp Val Gin Gly Ser Leu Ser Ser Leu Leu Lys His Leu 
130 135 140 

gag gag gtc cca caa etc gtc aag ctt ggg ttg aaa aac tgg aga etc 480 

Glu Glu Val Pro Gin Leu Val Lys Leu Gly Leu Lys Asn Trp Arg Leu 
145 150 155 160 

aca gat aca gag att aga att tta ggt gca ttt ttt gga aag aac cct 528 

Thr Asp Thr Glu He Arg He Leu Gly Ala Phe Phe Gly Lys Asn Pro 
165 170 175 

ctg aaa aac ttc cag cag ttg aat ttg gcg gga aat cgt gtg age agt 576 

Leu Lys Asn Phe Gin Gin Leu Asn Leu Ala Gly Asn Arg Val Ser Ser 

180 185 190 

gat gga tgg ctt gee ttc atg ggt gta ttt gag aat ctt aag 618 

Asp Gly Trp Leu Ala Phe Met Gly Val Phe Glu Asn Leu Lys 

195 200 205 



<210> 182 
<211> 206 
<212> PRT 

<213> Homo sapiens 
<400> 182 

Gly Asn Leu Lys Asn Leu Thr Lys Leu He Met Asp Asn He Lys Met 

15 10 15 

Asn Glu Glu Asp Ala He Lys Leu Ala Glu Gly Leu Lys Asn Leu Lys 

20 25 30 

Lys Met Cys Leu Phe His Leu Thr His Leu Ser Asp He Gly Glu Gly 
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35 




40 




45 








Met Asp 


Tyr lie Val Lys 


Ser 


Leu Ser Ser Glu 


Pro 


Cys 


Asp 


Leu 


Glu 


50 




55 




60 










Glu lie 


Gin Leu Val Ser 


Cys 


Cys Leu Ser Ala 


Asn 


Ala 


Val 


Lys 


He 


65 


70 




75 










80 


Leu Ala 


Gin Asn Leu His 


Asn 


Leu Val Lys Leu 


Ser 


He 


Leu Asp Leu 




85 




90 








95 




Ser Glu 


Asn Tyr Leu Glu Lys 


Asp Gly Asn Glu 


Ala 


Leu 


His 


Glu 


Leu 




100 




105 






110 






lie Asp 


Arg Met Asn Val 


Leu 


Glu Gin Leu Thr 


Ala 


Leu 


Met 


Leu 


Pro 




115 




120 




125 








Trp Gly 


Cys Asp Val Gin Gly 


Ser Leu Ser Ser 


Leu 


Leu 


Lys 


His 


Leu 


130 




135 




140 










Glu Glu 


Val Pro Gin Leu 


Val 


Lys Leu Gly Leu 


Lys 


Asn 


Trp Arg 


Leu 


145 


150 




155 










160 


Thr Asp 


Thr Glu lie Arg 


He 


Leu Gly Ala Phe 


Phe Gly 


Lys 


Asn 


Pro 




165 




170 








175 




Leu Lys 


Asn Phe Gin Gin 


Leu 


Asn Leu Ala Gly 


Asn Arg 


Val 


Ser 


Ser 




180 




185 






190 






Asp Gly 


Trp Leu Ala Phe 


Met 


Gly Val Phe Glu 


Asn 


Leu 


Lys 








195 




200 




205 









<210> 183 
<211> 165 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . . (165) 

<400> 183 

acc tac att ccc age agg get gta tct ttg ttc ttc aac tgg aag cag 4 8 
Thr Tyr He Pro Ser Arg Ala Val Ser Leu Phe Phe Asn Trp Lys Gin 
1 5 10 15 

gaa ttc agg act ctg gag gtc aca etc egg gat ttc age aag ttg aat 96 
Glu Phe Arg Thr Leu Glu Val Thr Leu Arg Asp Phe Ser Lys Leu Asn 
20 25 30 

aag caa gat ate aga tat ctg ggg aaa ata ttc age tct gec aca age 144 
Lys Gin Asp He Arg Tyr Leu Gly Lys He Phe Ser Ser Ala Thr Ser 
35 40 45 

etc agg ctg caa ata aag aga 165 
Leu Arg Leu Gin lie Lys Arg 
50 55 



<210> 184 

<211> 55 

<212> PRT 

<213> Homo sapiens 
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<400> 184 

Thr Tyr lie Pro Ser Arg Ala Val Ser Leu Phe Phe Asn Trp Lys Gin 

15 10 15 

Glu Phe Arg Thr Leu Glu Val Thr Leu Arg Asp Phe Ser Lys Leu Asn 

20 25 30 

Lys Gin Asp lie Arg Tyr Leu Gly Lys lie Phe Ser Ser Ala Thr Ser 

35 40 45 

Leu Arg Leu Gin lie Lys Arg 
50 55 



<210> 


135 




<211> 


19 




<212> 


DNA 




<213> 


Artificial Sequence 


<220> 






<223> 


primer 


<400> 


185 




gaaatgtgct 


cgcaggagg 


<210> 


186 




<211> 


20 




<212> 


DNA 




<213> 


Artificial Sequence 


<220> 






<223> 


primer 


<400> 


186 




gatgagcttc 


tgacaggccc 


<210> 


187 




<211> 


3063 




<212> 


DNA 




<213> 


Homo 


sapiens 


<220> 






<221> 


CDS 




<222> 


(1) . 


. . (2385) 


<221> 


CDS 




<222> 


(2389) . . . (2928) 


<400> 


187 





19 



20 



tgt gaa atg tgc teg cag gag get ttt cag gca cag agg age cag ctg 48 
Cys Glu Met Cys Ser Gin Glu Ala Phe Gin Ala Gin Arg Ser Gin Leu 
15 10 15 

gtc gag ctg ctg gtc tea ggg tec ctg gaa ggc ttc gag agt gtc ctg 96 
Val Glu Leu Leu Val Ser Gly Ser Leu Glu Gly Phe Glu Ser Val Leu 
20 25 30 

gac tgg ctg ctg tec tgg gag gtc etc tec tgg gag gac tac gag ggc 144 
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Asp Trp Leu Leu Ser Trp Glu Val Leu Ser Trp Glu Asp Tyr Glu Gly 

35 40 45 

ttc cac etc ctg ggc cag cct etc tec cac ttg gee agg cgc ctt ctg 192 

Phe His Leu Leu Gly Gin Pro Leu Ser His Leu Ala Arg Arg Leu Leu 
50 55 60 

gac acc gtc tgg aat aag ggt act tgg gec tgt cag aag etc ate gcg 240 

Asp Thr Val Trp Asn Lys Gly Thr Trp Ala Cys Gin Lys Leu lie Ala 
65 70 75 80 

get gec caa gaa gec cag gec gac age cag tec ccc aag ctg cat ggc 288 

Ala Ala Gin Glu Ala Gin Ala Asp Ser Gin Ser Pro Lys Leu His Gly 
85 90 95 

tgc tgg gac ccc cac teg etc cac cca gee cga gac ctg cag agt cac 336 

Cys Trp Asp Pro His Ser Leu His Pro Ala Arg Asp Leu Gin Ser His 

100 105 110 

egg cca gee att gtc agg agg etc cac age cat gtg gag aac atg ctg 384 

Arg Pro Ala lie Val Arg Arg Leu His Ser His Val Glu Asn Met Leu 

115 120 125 

gac ctg gca tgg gag egg ggt ttc gtc age cag tat gaa tgt gat gaa 43 2 

Asp Leu Ala Trp Glu Arg Gly Phe Val Ser Gin Tyr Glu Cys Asp Glu 
130 135 140 

ate agg ttg ccg ate ttc aca ccg tec cag agg gca aga agg ctg ctt 48 0 

lie Arg Leu Pro lie Phe Thr Pro Ser Gin Arg Ala Arg Arg Leu Leu 
145 150 155 160 

gat ctt gee acg gtg aaa gcg aat gga ttg get gec ttc ctt eta caa 52 8 

Asp Leu Ala Thr Val Lys Ala Asn Gly Leu Ala Ala Phe Leu Leu Gin 
165 170 175 

cat gtt cag gaa tta cca gtc cca ttg gee ctg cct ttg gaa get gee 576 

His Val Gin Glu Leu Pro Val Pro Leu Ala Leu Pro Leu Glu Ala Ala 

180 185 190 

aca tgc aag aag tat atg gec aag ctg agg acc acg gtg tct get cag 624 

Thr Cys Lys Lys Tyr Met Ala Lys Leu Arg Thr Thr Val Ser Ala Gin 

195 200 205 

tct cgc ttc etc agt acc tat gat gga gca gag acg etc tgc ctg gag 672 

Ser Arg Phe Leu Ser Thr Tyr Asp Gly Ala Glu Thr Leu Cys Leu Glu 
210 215 220 

gac ata tac aca gag aat gtc ctg gag gtc tgg gca gat gtg ggc atg 720 

Asp lie Tyr Thr Glu Asn Val Leu Glu Val Trp Ala Asp Val Gly Met 
225 230 235 240 

get gga ccc ccg cag aag age cca gec acc ctg ggc ctg gag gag etc 768 

Ala Gly Pro Pro Gin Lys Ser Pro Ala Thr Leu Gly Leu Glu Glu Leu 
245 250 255 
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ttc age acc cct ggc cac etc aat gac gat gcg gac act gtg ctg gtg 816 

Phe Ser Thr Pro Gly His Leu Asn Asp Asp Ala Asp Thr Val Leu Val 

260 265 270 

9^9 ggt gag gcg ggc agt ggc aag age acg etc ctg cag egg ctg cac 864 

Val Gly Glu Ala Gly Ser Gly Lys Ser Thr Leu Leu Gin Arg Leu His 

275 280 285 

ttg ctg tgg get gca ggg caa gac ttc cag gaa ttt etc ttt gtc ttc 912 

Leu Leu Trp Ala Ala Gly Gin Asp Phe Gin Glu Phe Leu Phe Val Phe 

290 295 300 

cca ttc age tgc egg cag ctg cag tgc atg gee aaa cca etc tct gtg 960 

Pro Phe Ser Cys Arg Gin Leu Gin Cys Met Ala Lys Pro Leu Ser Val 

305 310 315 320 

egg act eta etc ttt gag cac tgc tgt tgg cct gat gtt ggt caa gaa 1008 

Arg Thr Leu Leu Phe Glu His Cys Cys Trp Pro Asp Val Gly Gin Glu 

325 330 335 

gac ate ttc cag tta etc ctt gac cac cct gac cgt gtc ctg tta acc 1056 

Asp lie Phe Gin Leu Leu Leu Asp His Px~o Asp Arg Val Leu Leu Thr 

340 345 350 

ttt gat ggc ttt gac gag ttc aag ttc agg ttc acg gat cgt gaa cgc 1104 

Phe Asp Gly Phe Asp Glu Phe Lys Phe Arg Phe Thr Asp Arg Glu Arg 

355 360 365 

cac tgc tec ccg acc gac ccc acc tct gtc cag acc ctg etc ttc aac 1152 

His Cys Ser Pro Thr Asp Pro Thr Ser Val Gin Thr Leu Leu Phe Asn 

370 375 380 

ctt ctg cag ggc aac ctg ctg aag aat gec cgc aag gtg gtg acc age 1200 

Leu Leu Gin Gly Asn Leu Leu Lys Asn Ala Arg Lys Val Val Thr Ser 

385 390 395 400 

cgt ccg gec get gtg teg gcg ttc etc agg aag tac ate cgc acc gag 1248 

Arg Pro Ala Ala Val Ser Ala Phe Leu Arg Lys Tyr lie Arg Thr Glu 

405 410 415 

ttc aac etc aag ggc ttc tct gaa cag ggc ate gag ctg tac ctg agg 1296 

Phe Asn Leu Lys Gly Phe Ser Glu Gin Gly lie Glu Leu Tyr Leu Arg 

420 425 430 

aag cgc cat cat gag ccc ggg gtg gcg gac cgc etc ate cgc ctg etc 1344 

Lys Arg His His Glu Pro Gly Val Ala Asp Arg Leu lie Arg Leu Leu 

435 440 445 

caa gag acc tea gee ctg cac ggt ttg tgc cac ctg cct gtc ttc tea 1392 
Gin Glu Thr Ser Ala Leu His Gly Leu Cys His Leu Pro Val Phe Ser 

450 455 460 

tgg atg gtg tec aaa tgc cac cag gaa ctg ttg ctg cag gag ggg ggg 1440 
Trp Met Val Ser Lys Cys His Gin Glu Leu Leu Leu Gin Glu Gly Gly 

465 470 475 480 
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tec cca aag acc act aca gat atg tac ctg ctg att ctg cag cat ttt 1488 
Ser Pro Lys Thr Thr Thr Asp Met Tyr Leu Leu lie Leu Gin His Phe 
485 490 495 

ctg ctg cat gec acc ccc cca gac tea get tec caa ggt ctg gga ccc 1536 
Leu Leu His Ala Thr Pro Pro Asp Ser Ala Ser Gin Gly Leu Gly Pro 
500 505 510 

agt ctt ctt egg ggc cgc etc ccc acc etc ctg cac ctg ggc aga ctg 1584 
Ser Leu Leu Arg Gly Arg Leu Pro Thr Leu Leu His Leu Gly Arg Leu 
515 520 525 

get ctg tgg ggc ctg ggc atg tgc tgc tac gtg ttc tea gee cag cag 1632 
Ala Leu Trp Gly Leu Gly Met Cys Cys Tyr Val Phe Ser Ala Gin Gin 
530 535 540 

etc cag gca gca cag gtc age cct gat gac att tct ctt ggc ttc ctg 16 80 
Leu Gin Ala Ala Gin Val Ser Pro Asp Asp lie Ser Leu Gly Phe Leu 
545 550 555 560 

gtg cgt gee aaa ggt gtc gtg cca ggg agt acg gcg ccc ctg gaa ttc 1728 
Val Arg Ala Lys Gly Val Val Pro Gly Ser Thr Ala Pro Leu Glu Phe 
565 570 575 

ctt cac ate act ttc cag tgc ttc ttt gec gcg ttc tac ctg gca etc 1776 
Leu His lie Thr Phe Gin Cys Phe Phe Ala Ala Phe Tyr Leu Ala Leu 
580 585 590 

agt get gat gtg cca cca get ttg etc aga cac etc ttc aat tgt ggc 1824 
Ser Ala Asp Val Pro Pro Ala Leu Leu Arg His Leu Phe Asn Cys Gly 
595 600 605 

agg cca ggc aac tea cca atg gee agg etc ctg ccc acg atg tgc ate 18 72 
Arg Pro Gly Asn Ser Pro Met Ala Arg Leu Leu Pro Thr Met Cys lie 
610 615 620 

cag gec teg gag gga aag gac age age gtg gca get ttg ctg cag aag 1920. 
Gin Ala Ser Glu Gly Lys Asp Ser Ser Val Ala Ala Leu Leu Gin Lys 
625 630 635 640 

gee gag ccg cac aac ctt cag ate aca gca gee ttc ctg gca ggg ctg 1968 
Ala Glu Pro His Asn Leu Gin lie Thr Ala Ala Phe Leu Ala Gly Leu 
645 650 655 

ttg tec egg gag cac tgg ggc ctg ctg get gag tgc cag aca tct gag 2016 
Leu Ser Arg Glu His Trp Gly Leu Leu Ala Glu Cys Gin Thr Ser Glu 
660 665 670 

aag gec ctg etc egg cgc cag gee tgt gec cgc tgg tgt ctg gec cgc 2064 
Lys Ala Leu Leu Arg Arg Gin Ala Cys Ala Arg Trp Cys Leu Ala Arg 
675 680 685 

age etc cgc aag cac ttc cac tec ate ccg cca get gca ccg ggt gag 2112 
Ser Leu Arg Lys His Phe His Ser lie Pro Pro Ala Ala Pro Gly Glu 
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690 695 700 

gcc aag age gtg cat gec atg ccc ggg ttc ate tgg etc ate egg age 2160 
Ala Lys Ser Val His Ala Met Pro Gly Phe lie Trp Leu lie Arg Ser 
705 710 715 720 

ctg tac gag atg cag gag gag egg ctg get egg aag get gca cgt ggc 22 08 
Leu Tyr Glu Met Gin Glu Glu Arg Leu Ala Arg Lys Ala Ala Arg Gly 
725 730 735 

ctg aat gtt ggg cac etc aag ttg aca ttt tgc agt gtg ggc ccc act 2256 
Leu Asn Val Gly His Leu Lys Leu Thr Phe Cys Ser Val Gly Pro Thr 
740 745 750 

gag tgt get gcc ctg gcc ttt gtg ctg cag cac etc egg egg ccc gtg 2304 
Glu Cys Ala Ala Leu Ala Phe Val Leu Gin His Leu Arg Arg Pro Val 
755 760 765 

gcc ctg cag ctg gac tac aac tct gtg ggt gac att ggc gtg gag cag 2352 
Ala Leu Gin Leu Asp Tyr Asn Ser Val Gly Asp lie Gly Val Glu Gin 
770 775 780 

ctg ctg cct tgc ctt ggt gtc tgc aag get ctg taa ttc tgg ggc aac 2400 
Leu Leu Pro Cys Leu Gly Val Cys Lys Ala Leu Phe Trp Gly Asn 
785 790 795 

aga gtg ggt gac gag ggg gcc cag gcc ctg get gaa gcc ttg ggt gat 2448 
Arg Val Gly Asp Glu Gly Ala Gin Ala Leu Ala Glu Ala Leu Gly Asp 
800 805 810 815 

cac cag age ttg agg tgg etc age ctg gtg ggg aac aac att ggc agt 2496 
His Gin Ser Leu Arg Trp Leu Ser Leu Val Gly Asn Asn lie Gly Ser 
820 825 830 

gtg ggt gcc caa gcc ttg gca ctg atg ctg gca aag aac gtc atg eta 2544 
Val Gly Ala Gin Ala Leu Ala Leu Met Leu Ala Lys Asn Val Met Leu 
835 840 845 

gaa gaa etc tgc ctg gag gag aac cat etc cag gat gaa ggt gta tgt 2 5 92 
Glu Glu Leu Cys Leu Glu Glu Asn His Leu Gin Asp Glu Gly Val Cys 
850 855 860 

tct etc gca gaa gga ctg aag aaa aat tea agt ttg aaa ate ctg aac 264 0 
Ser Leu Ala Glu Gly Leu Lys Lys Asn Ser Ser Leu Lys lie Leu Asn 
865 870 875 

ata aaa att cat get teg gga ttc aac aaa etc ttg gaa age att ttc 2688 
lie Lys lie His Ala Ser Gly Phe Asn Lys Leu Leu Glu Ser lie Phe 
880 885 890 895 

tgc ate etc ctg gtt gtg gaa gca ttt ttc ctg cag aaa gtt gtc aag 2736 
Cys lie Leu Leu Val Val Glu Ala Phe Phe Leu Gin Lys Val Val Lys 
900 905 910 

att ctt gaa gaa atg gta gtc agt tgg eta gag gtc agg ttg tec aat 2784 
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He Leu Glu Glu Met Val Val Ser Trp Leu Glu Val Arg Leu Ser Asn 
915 920 925 

aac tgc ate acc tac eta ggg gca gaa gec etc ctg cag gee ctt gaa 2832 
Asn Cys He Thr Tyr Leu Gly Ala Glu Ala Leu Leu Gin Ala Leu Glu 
930 935 940 

agg aat gac acc ate ctg gaa gtc tgg etc cga ggg aac act ttc tct 2880 
Arg Asn Asp Thr He Leu Glu Val Trp Leu Arg Gly Asn Thr Phe Ser 
945 950 955 

eta gag gag gtt gac aag etc ggc tgc agg gac acc aga etc ttg ctt 2928 
Leu Glu Glu Val Asp Lys Leu Gly Cys Arg Asp Thr Arg Leu Leu Leu 
960 965 970 975 

tgaagtctcc gggaggatgt tegtctcagt ttgtttgtga gcaggctgtg agtttgggcc 2988 
ccagaggctg ggtgacatgt gttggcagcc tcttcaaaat gagccctgtc ctgectaagg 3048 
ctgaacttgt tttct 3063 

<210> 188 
<211> 795 
<212> PRT 
<213> Homo sapiens 

<400> 188 

Cys Glu Met Cys Ser Gin Glu Ala Phe Gin Ala Gin Arg Ser Gin Leu 

15 io 15 

Val Glu Leu Leu Val Ser Gly Ser Leu Glu Gly Phe Glu Ser Val Leu 

20 25 30 

Asp Trp Leu Leu Ser Trp Glu Val Leu Ser Trp Glu Asp Tyr Glu Gly 

35 40 45 

Phe His Leu Leu Gly Gin Pro Leu Ser His Leu Ala Arg Arg Leu Leu 

50 55 60 

Asp Thr Val Trp Asn Lys Gly Thr Trp Ala Cys Gin Lys Leu lie Ala 
65 70 75 80 

Ala Ala Gin Glu Ala Gin Ala Asp Ser Gin Ser Pro Lys Leu His Gly 

85 90 95 

Cys Trp Asp Pro His Ser Leu His Pro Ala Arg Asp Leu Gin Ser His 

100 105 110 

Arg Pro Ala He Val Arg Arg Leu His Ser His Val Glu Asn Met Leu 

115 120 125 

Asp Leu Ala Trp Glu Arg Gly Phe Val Ser Gin Tyr Glu Cys Asp Glu 

130 135 140 

He Arg Leu Pro He Phe Thr Pro Ser Gin Arg Ala Arg Arg Leu Leu 
145 150 155 160 

Asp Leu Ala Thr Val Lys Ala Asn Gly Leu Ala Ala Phe Leu Leu Gin 

165 170 175 

His Val Gin Glu Leu Pro Val Pro Leu Ala Leu Pro Leu Glu Ala Ala 

180 185 190 

Thr Cys Lys Lys Tyr Met Ala Lys Leu Arg Thr Thr Val Ser Ala Gin 

195 200 205 

Ser Arg Phe Leu Ser Thr Tyr Asp Gly Ala Glu Thr Leu Cys Leu Glu 

210 215 220 

Asp He Tyr Thr Glu Asn Val Leu Glu Val Trp Ala Asp Val Gly Met 
225 230 235 240 
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Ala Gly Pro Pro 

Phe Ser Thr Pro 
260 

Val Gly Glu Ala 
275 

Leu Leu Trp Ala 
290 

Pro Phe Ser Cys 
305 

Arg Thr Leu Leu 

Asp lie Phe Gin 
340 

Phe Asp Gly Phe 
355 

His Cys Ser Pro 
370 

Leu Leu Gin Gly 
385 

Arg Pro Ala Ala 

Phe Asn Leu Lys 
420 

Lys Arg His His 
435 

Gin Glu Thr Ser 
450 

Trp Met Val Ser 
465 

Ser Pro Lys Thr 

Leu Leu His Ala 
500 

Ser Leu Leu Arg 
515 

Ala Leu Trp Gly 
530 

Leu Gin Ala Ala 
545 

Val Arg Ala Lys 

Leu His lie Thr 
580 

Ser Ala Asp Val 
595 

Arg Pro Gly Asn 
610 

Gin Ala Ser Glu 
625 

Ala Glu Pro His 

Leu Ser Arg Glu 
660 

Lys Ala Leu Leu 



Gin Lys Ser Pro 
245 

Gly His Leu Asn 

Gly Ser Gly Lys 
280 

Ala Gly Gin Asp 
295 

Arg Gin Leu Gin 
310 

Phe Glu His Cys 
325 

Leu Leu Leu Asp 

Asp Glu Phe Lys 
360 

Thr Asp Pro Thr 
375 

Asn Leu Leu Lys 
390 

Val Ser Ala Phe 
405 

Gly Phe Ser Glu 

Glu Pro Gly Val 
440 

Ala Leu His Gly 
455 

Lys Cys His Gin 
470 

Thr Thr Asp Met 
485 

Thr Pro Pro Asp 

Gly Arg Leu Pro 
520 

Leu Gly Met Cys 
535 

Gin Val Ser Pro 
550 

Gly Val Val Pro 
565 

Phe Gin Cys Phe 

Pro Pro Ala Leu 
600 

Ser Pro Met Ala 
615 

Gly Lys Asp Ser 
630 

Asn Leu Gin lie 
645 

His Trp Gly Leu 
Arg Arg Gin Ala 



Ala Thr Leu Gly 
250 

Asp Asp Ala Asp 
265 

Ser Thr Leu Leu 

Phe Gin Glu Phe 
300 

Cys Met Ala Lys 
315 

Cys Trp Pro Asp 
330 

His Pro Asp Arg 
345 

Phe Arg Phe Thr 

Ser Val Gin Thr 
380 

Asn Ala Arg Lys 
395 

Leu Arg Lys Tyr 
410 

Gin Gly He Glu 
425 

Ala Asp Arg Leu 

Leu Cys His Leu 
460 

Glu Leu Leu Leu 
475 

Tyr Leu Leu He 
490 

Ser Ala Ser Gin 
505 

Thr Leu Leu His 

Cys Tyr Val Phe 
540 

Asp Asp He Ser 
555 

Gly Ser Thr Ala 
570 

Phe Ala Ala Phe 
585 

Leu Arg His Leu 

Arg Leu Leu Pro 
620 

Ser Val Ala Ala 
635 

Thr Ala Ala Phe 
650 

Leu Ala Glu Cys 
665 

Cys Ala Arg Trp 



- 65 - 



Leu Glu Glu Leu 
255 

Thr Val Leu Val 
270 

Gin Arg Leu His 
285 

Leu Phe Val Phe 

Pro Leu Ser Val 
320 

Val Gly Gin Glu 
335 

Val Leu Leu Thr 
350 

Asp Arg Glu Arg 
365 

Leu Leu Phe Asn 

Val Val Thr Ser 
400 

He Arg Thr Glu" 
415 

Leu Tyr Leu Arg 
430 

He Arg Leu Leu 
445 

Pro Val Phe Ser 

Gin Glu Gly Gly 
480 

Leu Gin His Phe 
495 

Gly Leu Gly Pro 
510 

Leu Gly Arg Leu 
525 

Ser Ala Gin Gin 

Leu Gly Phe Leu 
560 

Pro Leu Glu Phe 
575 

Tyr Leu Ala Leu 
590 

Phe Asn Cys Gly 
605 

Thr Met Cys He 

Leu Leu Gin Lys 
640 

Leu Ala Gly Leu 
655 

Gin Thr Ser Glu 
670 

Cys Leu Ala Arg 
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675 680 685 

Ser Leu Arg Lys His Phe His Ser lie Pro Pro Ala Ala Pro Gly Glu 

690 695 700 

Ala Lys Ser Val His Ala Met Pro Gly Phe lie Trp Leu He Arg Ser 
705 710 715 720 

Leu Tyr Glu Met Gin Glu Glu Arg Leu Ala Arg Lys Ala Ala Arg Gly 

725 730 735 

Leu Asn Val Gly His Leu Lys Leu Thr Phe Cys Ser Val Gly Pro Thr 

740 745 750 

Glu Cys Ala Ala Leu Ala Phe Val Leu Gin His Leu Arg Arg Pro Val 

755 760 765 

Ala Leu Gin Leu Asp Tyr Asn Ser Val Gly Asp He Gly Val Glu Gin 

770 775 780 

Leu Leu Pro Cys Leu Gly Val Cys Lys Ala Leu 
785 790 795 



<210> 189 
<211> 180 
<212> PRT 
<213> Homo sapiens 

<400> 189 



Phe 


Trp 


Gly 


Asn 


Arg Val 


Gly Asp 


Glu Gly Ala 


Gin 


Ala 


Leu 


Ala 


Glu 


1 








5 








10 








15 




Ala 


Leu 


Gly 


Asp 


His 


Gin 


Ser 


Leu 


Arg Trp Leu 


Ser 


Leu 


Val 


Gly 


Asn 








20 










25 






30 






Asn 


He 


Gly 


Ser 


Val 


Gly Ala Gin 


Ala Leu Ala 


Leu 


Met 


Leu 


Ala 


Lys 






35 










40 






45 








Asn 


Val 


Met 


Leu 


Glu 


Glu 


Leu 


Cys 


Leu Glu Glu 


Asn 


His 


Leu 


Gin 


Asp 




50 










55 






60 










Glu 


Gly 


Val 


Cys 


Ser 


Leu 


Ala 


Glu 


Gly Leu Lys 


Lys 


Asn 


Ser 


Ser 


Leu 


65 










70 






75 










80 


Lys 


He 


Leu 


Asn 


He 


Lys 


He 


His 


Ala Ser Gly 


Phe 


Asn 


Lys 


Leu 


Leu 










85 








90 








95 




Glu 


Ser 


He 


Phe 


Cys 


lie 


Leu 


Leu 


Val Val Glu 


Ala 


Phe 


Phe 


Leu 


Gin 








100 










105 






110 






Lys 


Val 


Val 


Lys 


He 


Leu 


Glu 


Glu 


Met Val Val 


Ser 


Trp 


Leu 


Glu 


Val 






115 










120 






125 








Arg 


Leu 


Ser 


Asn 


Asn 


Cys 


He 


Thr 


Tyr Leu Gly 


Ala 


Glu 


Ala 


Leu 


Leu 




130 










135 






140 










Gin 


Ala 


Leu 


Glu 


Arg 


Asn 


Asp 


Thr 


He Leu Glu 


Val 


Trp Leu Arg 


Gly 


145 










150 






155 










160 


Asn 


Thr 


Phe 


Ser 


Leu 


Glu 


Glu 


Val 


Asp Lys Leu 


Gly Cys Arg Asp 


Thr 










165 








170 








175 




Arg 


Leu 


Leu 


Leu 























180 



<210> 190 
<211> 721 
<212> DNA 
<213> Mus musculus 

<220> 
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<221> CDS 

<222> (193) . . . (612) 
<400> 190 

cctggggttc ctgcacatta ccttccgtgc ttttttgccg ctttctactt ggctgtcagt 60 
gctgacacat cggtggcctc tctcaagcac cttttcagct gtggccggct gggcagctca 12 0 
ctgctgggaa ggctgctgcc caacctgtgt atccagggct ccagagtcaa gaagggcagc 18 0 
gaagcagccc tg ctg cag aag get gag cca cac aac ctg caa ate aca gca 231 
Leu Gin Lys Ala Glu Pro His Asn Leu Gin He Thr Ala 
15 10 

gec ttc eta gca ggt ctg ttg tec cag cag cat egg gac ctg ttg get 279 
Ala Phe Leu Ala Gly Leu Leu Ser Gin Gin His Arg Asp Leu Leu Ala 
15 20 25 

gca tgc cag gtc tec gag agg gta ctg etc cag cgt cag gca cgt gee 327 
Ala Cys Gin Val Ser Glu Arg Val Leu Leu Gin Arg Gin Ala Arg Ala 
30 35 40 45 

cgc teg tgt ctg gec cac age etc cgc gag cac ttc cat tec ate ccg 375 
Arg Ser Cys Leu Ala His Ser Leu Arg Glu His Phe His Ser He Pro 
50 55 60 

cct gee gtg ccc ggt gag ace aag age atg cat get atg ccg ggc ttc 423 
Pro Ala Val Pro Gly Glu Thr Lys Ser Met His Ala Met Pro Gly Phe 
65 70 75 

att tgg etc ate cgt age ctg tac gag atg cag gag gag cag ttg gec 4 71 
He Trp Leu He Arg Ser Leu Tyr Glu Met Gin Glu Glu Gin Leu Ala 
80 85 90 

cag gag get gtc cgt cgc ttg gac ate ggg cac ctg aag ttg aca ttt 519 
Gin Glu Ala Val Arg Arg Leu Asp He Gly His Leu Lys Leu Thr Phe 
95 100 105 

tgc aga gtg ggc cct gca gag tgt get gca ctg gee ttt gta ctg caa 567 
Cys Arg Val Gly Pro Ala Glu Cys Ala Ala Leu Ala Phe Val Leu Gin 
HO 115 120 125 

cat etc cag egg cct gtg gee eta cag ctg gat tac aac tct gtg 612 
His Leu Gin Arg Pro Val Ala Leu Gin Leu Asp Tyr Asn Ser Val 
130 135 140 

ggagatgttg ggagtggaac agctgcgacc gtgcctttgg ggtctgeaca gctctgtagt 6 72 
gagtgtgaca aggtcttgee gattgggect gtggcaaatg ctactgtca 721 

<210> 191 
<211> 140 
<212> PRT 

<213> Mus musculus 
<400> 191 

Leu Gin Lys Ala Glu Pro His Asn Leu Gin He Thr Ala Ala Phe Leu 

15 io is 

Ala Gly Leu Leu Ser Gin Gin His Arg Asp Leu Leu Ala Ala Cys Gin 
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Val Ser Glu 
35 

Leu Ala His 
50 

Pro Gly Glu 
65 

lie Arg Ser 

Val Arg Arg 

Gly Pro Ala 
115 

Arg Pro Val 
130 



<210> 192 
<211> 419 
<212> DNA 
<213> Mus mus cuius 

<220> 

<221> CDS 

<222> (1) . . . (417) 

<400> 192 

ctg cag aag get gag cca cac aac ctg cag ate aca gca gec ttc eta 4 8 
Leu Gin Lys Ala Glu Pro His Asn Leu Gin lie Thr Ala Ala Phe Leu 
15 10 15 

gca ggt ctg ttg tec cag cag cat egg gac ctg ttg get gca tgc cag 96 
Ala Gly Leu Leu Ser Gin Gin His Arg Asp Leu Leu Ala Ala Cys Gin 
20 25 30 

ate tec gag agg gtg ctg etc cag cgt cag gca cgt gee cgc teg tgt 144 
lie Ser Glu Arg Val Leu Leu Gin Arg Gin Ala Arg Ala Arg Ser Cys 
35 40 45 

ctg gee cac age etc cgc gag cac ttc cat tec ate ccg cct gee gtg 192 
Leu Ala His Ser Leu Arg Glu His Phe His Ser lie Pro Pro Ala Val 
50 55 60 

ccc ggt gag ace aag age atg cat get atg ccg ggc ttt att tgg etc 240 
Pro Gly Glu Thr Lys Ser Met His Ala Met Pro Gly Phe lie Trp Leu 
65 70 75 80 

ate egg age ctg tac gag atg cag gag gag cag ttg gee cag gag get 288 
lie Arg Ser Leu Tyr Glu Met Gin Glu Glu Gin Leu Ala Gin Glu Ala 
85 90 95 

gtc cgt cgc ttg gac ate ggg cac ctg aag ttg aca ttt tgc aga gtg 336 
Val Arg Arg Leu Asp lie Gly His Leu Lys Leu^ Thr Phe Cys Arg Val 
100 105 110 



20 25 
Arg Val Leu Leu Gin Arg Gin 
40 

Ser Leu Arg Glu His Phe His 
55 

Thr Lys Ser Met His Ala Met 
70 

Leu Tyr Glu Met Gin Glu Glu 

85 90 
Leu Asp lie Gly His Leu Lys 
100 105 
Glu Cys Ala Ala Leu Ala Phe 
120 

Ala Leu Gin Leu Asp Tyr Asn 
135 



30 

Ala Arg Ala Arg Ser Cys 
45 

Ser He Pro Pro Ala Val 
60 

Pro Gly Phe He Trp Leu 
75 80 
Gin Leu Ala Gin Glu Ala 
95 

Leu Thr Phe Cys Arg Val 
110 

Val Leu Gin His Leu Gin 
125 

Ser Val 
140 
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ggc cct gca gag tgt get gcg ctg gec ttt gta ctg caa cat etc cag 3 84 

Gly Pro Ala Giu Cys Ala Ala Leu Ala Phe Val Leu Gin His Leu Gin 
115 120 125 

egg cct gtg gee eta cag ctg gat tac aac tct gt 419 

Arg Pro Val Ala Leu Gin Leu Asp Tyr Asn Ser 
130 135 



<210> 193 
<211> 139 
<212> PRT 

<213> Mus musculus 
<400> 193 

Leu Gin Lys Ala Glu Pro His Asn Leu Gin He Thr Ala Ala Phe Leu 

15 10 15 

Ala Gly Leu Leu Ser Gin Gin His Arg Asp Leu Leu Ala Ala Cys Gin 

20 25 30 

He Ser Glu Arg Val Leu Leu Gin Arg Gin Ala Arg Ala Arg Ser Cys 

35 40 45 

Leu Ala His Ser Leu Arg Glu His Phe His Ser He Pro Pro Ala Val 

50 55 60 

Pro Gly Glu Thr Lys Ser Met His Ala Met Pro Gly Phe He Trp Leu 
65 70 75 80 

He Arg Ser Leu Tyr Glu Met Gin Glu Glu Gin Leu Ala Gin Glu Ala 

85 90 95 

Val Arg Arg Leu Asp He Gly His Leu Lys Leu Thr Phe Cys Arg Val 

100 105 110 

Gly Pro Ala Glu Cys Ala Ala Leu Ala Phe Val Leu Gin His Leu Gin 

115 120 125 

Arg Pro Val Ala Leu Gin Leu Asp Tyr Asn Ser 
130 135 



<210> 194 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 194 

ctgeagaagg ctgagccaca caacct 2 6 

<210> 195 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 195 
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acagagttgt aatccagctg tagggccaca 3 0 
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